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Editorial 

Eureka 65 


F ollowing suit from the opening of several recent issues, I 
would like to begin by apologising forthe late publication 
of Eureka 65. As usual, several organisational, logistical, 
and printing headaches were to blame. In spite of this, editing 
Eureka has been a great experience, and I hope all of our read- 
ers enjoy the end result. 

In this issue, we have continued our custom of publishing nu- 
merous new, high guality mathematical articles from various 
fields. However, we have also included a selection of articles 
by noted mathematicians from now long ago issues that we 
felt were worth highlighting once more. We have additional- 
ly sought to increase the other content within the issue. As a 
result, Eureka 65 sees the return of several joke articles and 
features that have proved popular in the past. We hope that 
the outcome is an issue with something for everyone. 

This year, we have also begun the arduous task of digitising 
our old issues. Conseguently, a wide selection of issues will 
soon be available for instant viewing on the Archimedeans 1 
website. 

Finally, no editorial would be complete without giving thanks 
to those who made this issue possible. Particular gratitude 
goes to all of our sponsors, writers, the Archimedeans, and of 
course the readership. I hope you all enjoy Eureka 65! 


Editor 

Michael Grayling (Sidney Sussex) 

President 

Yanni Du (Pembroke) 


rr 


Michael Grayling 
April, 2018 
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Einstein Ring 

An Einstein ring is the deformation of the 
li$ht from a source (such as a galaxy or star) 
into a ring through gravitational lensing of 
the source's light by an object with an ex- 
tremely large mass (such as another galaxy 
or a black hole). 
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E very year is a great year for the Archimedeans and thisyear has not been any different. Since forming 
the 2017-2018 committee at the end of Lent term 201 7 we've had many great well attended events. 

In Easterterm we had our first Archimedeans BBQ summer party organized by our Events Manager, Yuen 
Ng. Thanks to his efforts put into it, our members had a lovely chilled afternoon in the sun relaxing as ex- 


ams are over. To also celebrate the end of exams we also had our annual punting trip where for once the 
weather couldn't have been nicer! 


To kick off the start of the academic year, Professor Bĕla Bollobas from Trinity College gave us our first 


speaker event of the term: "Counting with Flair". After a well-attended lecture, he was kind enough to 


invite all of us to his house for food and drink. The next talk of the term was by Vicky Neale, who came 


from Oxford to give us her talk on "What is the purpose of a proof"? After we then had Perla Sousi and 
Imre Leader giving us both fantastic talks. Last term we also had Tom Crawford, from the Naked Scientists 
coming to give a talk on maths communication. It was informative and interesting to hear about his efforts 
into making maths accessible and interesting to the general public. 

Yuen and Jin put together a great program in terms of social events. We had our classic pizza and board 
games evenings and pub quiz. They also organised the second integration contest for the Archimedeans 
which had a great turn out- it was great to see so many people interested in doing 2 hours of integration 
on a Saturday afternoon! 

In the coming, and last term for the current committee we've got a busy schedule lined up. Towards the 
end of term we will have the highly anticipated annual dinner taking place in DoubleTree Hilton. We're ex- 
pecting just short of 100 students and lecturers to attend this black tie 3 course meal to celebrate another 
year of the Archimedeans. 

In terms of speakers we have Professor Sarah Whitehouse coming from Sheffield to give our first lecture 
of term on "Associativity in topology". We've then got Paul Cook giving us a lecture on the Mathematics of 
Juggling, followed by Beth Romano, Gabriel Paternain, Eric Lauga and Nilanjana Datta. 

As my time as the president comes to an end, I would just like to say thank you to the amazing committee 
who have done an incredible job for the society so far. In particular Jin and Yuen have done an amazing 
job in socializing mathmos, Eve has managed to fill our term with speakers from all over the country. Dex- 
ter has also redesigned our website and Zheneng has also done a fantastic job in advertising our events 
with our newly designed posters filling the CMS with colour. I would also like to thank all of our subscrib- 
ers and members as without them this society wouldn't exist. 


The Committee 2017/18 


President 

Yanni Du (Pembroke) 


External Secretary 

Eve Pound (Murray Edwards) 


Events Managers 
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Image Analysis: Extracting 
Multifaceted Data from Images 

Joana Grah 

PhD Student, University of Cambridge 


I n our swiftly developing and globalised world, huge amounts of data are produced, processed and analysed on a 
daily basis. In comparison to the amount of data available only a few decades ago, nowadays this number is a mul- 
tiple thereof, and new challenges have arisen in terms of handling the £ big data. A whole new academic held has 
been established: data science. It incorporates knowledge from a variety of different research areas including computer 
science, applied mathematics and statistics. One particular branch, which is an entire research area by itself and is not 
at all limited to analysis of big data, is image processing. Images of various types are omnipresent and in the times of 
Facebook, Instagram and similar social media, everyone is both producing and consuming them almost constantly. Our 
brain interprets a picture we are looking at within milliseconds and in the same amount of time individual emotions and 
memories are triggered. The idiom ‘a picture is worth a thousand words’ stresses the fact that a single image is able to 
convey a multitude of information and messages. 



But how can images be viewed and treated in a formal mathematical way? We can consider an image being a continuous 
function mapping from the image domain, which is typically rectangular, to a set of intensity values. In the case of col- 
our images the values are three-dimensional vectors containing intensity information for the red, green and blue colour 
channel assuming the standard RGB colour space. 

Given this formulation, we are then able to perform mathematical operations and calculations on the image itself and 
apply a variety of methods and theory that have been developed, especially in the held of functional analysis and partial 
differential equations. 

In order to process an image on a computer, we need to discretise the continuous hmction. We can think of this process 
as dividing the image domain into a discrete uniform grid consisting of squares, each of which is made up of a constant 
intensity value. Those squares are called pixels. If the image domain is three-dimensional, the image is subdivided into 
cubes referred to as voxels. 

Zero intensity usually corresponds to black. In 8-bit greyscale images, intensity values range from 0 to 255 (256 = 2 8 
different values), where 255 corresponds to white. Modern microscopy devices commonly produce 16-bit images con- 
taining 2 16 = 65536 different intensity values. For computations, the intensity range is usually normalised such that values 
lie in the interval [0,1]. In the above hgure, we zoom into a greyscale image twice until we are able to recognise the pixel 
structure and observe that the intensity values in each pixel range between zero and one and the higher the number, the 
brighter the shade of grey. 

In recent years and decades, a large variety of different mathematical methods for analysing image hmctions and recon- 
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structing images from measurement data have been established. As mentioned above, many techniques are borrowed 
from the theory of partial differential equations, functional analysis, statistics and even topology. One particular held our 
working group, Cambridge Image Analysis, is concerned with a lot, are variational methods. They are used in order to re- 
trieve images from given observed data or measurements. For instance, we can reconstruct a magnetic resonance image 
of the brain from possibly sub-sampled Fourier space measurements or a computed tomography image of the abdomen 
based on x-ray measurements taken from different angles. In each case we need to make different assumptions on the 
forward model, i.e. which operations are needed to model the pathway from raw measurement data to the data we are 
given, and also we need to incorporate certain prior assumptions on the image we want to retrieve, e.g. piecewise-con- 
stant regions subdivided by sharp edges. Independent of the application, all variational models have a specihc structure 
in common. They always consist of two parts: a data term and a regularisation term. 

If we denote our given data by /, the forward operator by K and the image we would like to obtain by U, we can write 
a variational problem as follows 

min {£)(/, Ku ) + aR(u)}, 

u 

The aim is to minimise the energy functional, which is the sum of the data term D and the regulariser R weighted by 
a constant factor a > 0, with respect to U. The former penalises discrepancy of / and Ku , usually modelled by a 
suitable norm of the difference between the two. On the other hand, the latter enforces prior assumptions on U , for ex- 
ample smoothness in a certain sense. It is crucial to find a suitable weight a to balance the two terms. Let us think about 
how we solved optimisation problems back in secondary school. The basic recipe was calculating the first derivative and 
subsequently setting it to zero. We are doing exactly the same in order to solve variational problems, except from the fact 
that the energy functionals at hand are typically not differentiable in a classical way. Most of the times, we need to find 
good approximations. 

Although the methods in their basic form are always the same, the underlying imaging problems can be manifold. 
Real-world applications range from (bio)medicine (MRI, CT, PET, SPECT, ultrasound, microscopy) to plant sciences, 
forensics, ornithology and arts restoration (see http://www.damtp.cam.ac.uk/research/cia/ to find out more). 

original image u G a =5 * u G a =10 * u 



Let us finish this article with two prominent examples of image processing. First, the concept of convolution can be 
translated to images. In the discrete setting, we can regard a convolution as applying a convolution kernel, which can 
be written as a discrete filter mask containing values, and overlaying this mask onto every pixel in the original image. 
We start on the top left and go from top to bottom and from left to right. The dimensions of the filter mask are usually 
much smaller than the actual image size, e.g. 5x5 pixels. Applying this mask to a pixel, and equivalently calculating 
the convolution at this point, corresponds to multiplying the values of the mask by the pixel values it is overlaying with, 
the considered pixel coinciding with the centre of the mask, then summing everything up. A Gaussian filter kernel will 
have its highest value in the centre and decreasing values towards the edge of the mask. Applying the latter to an image 
corresponds to smoothing it, as we can regard the convolution with a Gaussian kernel as a pixel-wise weighted averaging 
operation. The above picture shows an RGB colour image (left), the same image smoothed by convolution with a Gaussi- 
an kernel with standard deviation equal to 5 (centre) and 10 (right), respectively. We can observe that the image becomes 
more blurred from left to right and that we lose information on fine structures such as leaves and blades of grass while 
larger structures such as the bridge and the branches and the crowns of the trees are still distinguishable. Performing 
convolution with a Gaussian kernel is a quick and easy way to remove small amounts of noise in an image. 







Edges constitute essential and important features in images. But how can we grasp them in a mathematical way? If we 
have a closer look at what prominent properties all edges have in common and how we can recognise them by eye, we 
note sharp changes in intensity. If we again think of secondary school and consider a one-dimensional function, abrupt 
changes are rehected in high values of the derivative of this function. Images are however two-dimensional objects, so we 
obtain a gradient in both x- and y-direction. The above hgure shows the gradient magnitude (centre) of a greyscale image 
(left) and we can indeed observe that high values of the former coincide with edges in the latter. On the right-hand side, 
we can notice that previous smoothing of the image by convolving it with a Gaussian kernel even leads to enhancement 
of major edges. 

Edge localisation and information about hrst- and higher-order derivatives play a crucial role in state-of-the-art imaging 
methods. Many different variational models have been successfully applied to a large variety of imaging problems, but 
even in the advent of artihcial intelligence and machine learning, computers are still not able to mimic the human visual 
system and how it most impressively interacts with the brain. 



Mathematics and Games 

P. M. Grundy 


I n any game between two opponents playing alternately, in which the legality of moves is not governed by chance, it 
is intuitively clear that there must be some best method of playing; and if both play correctly the result of the game is 
determined by the initial position. In fact if, when it is As turn he cannot force a win then, however A moves, B can 
follow with a move after which A again cannot force a win. By repeating these tactics B can at least secure a draw, which 
should be the conclusion of the game unless B can force a win. This reasoning holds even for card games, with the snag 
that there the players are not allowed to know the state of the game. 

There is a remarkably simple theory for the game with piles of matches (or nuts, etc.) called Nim. The move consists of 
taking any number ( > 1) of matches from any one heap, and the player taking the last match wins.* The theory depends 
on the operation of Nim Addition, denoted by when the numbers X\, X2-> • • • on the heaps are expressed in the 
scale of 2 as X{ — Yijdij2^ ( a^j = 0 or 1; i — 1,2,...) and bj = 0 or 1 according as X^a^j is even or odd, 
the Nim Sum 

X\ + X2 + • • *> 

N N 

is dehned to be ^jbj2 J . This operation obeys, like ordinary addition, the commutative and associative laws for brack- 
et-moving; and also has the fundamental property 

X + X = 0 (airy X). 

Now call the position (x \, X2-> • • • ) winning (W) if 

X\ + X2 + * * * = 0 , 

N N 

and losing (L) if not. Then the terminal state (all X{ — 0, when the games hnishes) is W; and also this classihcation of 
states obeys the characteristic recurrence-relations 

f from a non-terminal L it is always possible to move to a W, ^ 

1 from a W it is impossible to move to a W. 

A player who once moves to a winning position can evidently continue to do so at his subsequent moves, and so even- 
tually win. 

Nim Addition Table 

3 + 6 = 5 

e.g. N 

01234567.. 

10325476.. 

23016745.. 

32107654.. 

45670123.. 

54761032.. 

6 7 4 5 2 3 0 1.. 

76543210.. 


*There is a variation in which the last mover loses. 

Cf. C. L. Bouton, Annals of Mathematics (Harvard), 2nd Series, III (1901-2); W. W. R. Ball, Mathematical Recreations 
and Essays, Ch. I; Hardy and Wright, Theory of Numbers, etc. 

More generally, if a draw is impossible, a similar classihcation exists, provided the moves for the two players are identical 
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and depend only on the state of the game. First let each terminal state be classihed W or L according as (by the rules of 
the game) the player moving to it wins or loses. Owing to the fact that states of the game form a “partially ordered set” 
it is now possible, with these end-conditions fixed, to work backwards as far as required using (1) at each stage. This 
procedure is quite practicable, though laborious unless a lucky guess can be made. It follows again from (1) that a player 
who once moves to a W state can continue to do so, and eventually win either by himself moving to a terminal W, or by 
his opponent moving to a terminal L. 

In Nim a state X — (x \, X 2 , • • . ) may be regarded as reducible to a “product” X\ X% ... (in any order) of irre- 
ducible states X\ — (pC\, 0, . . . ), X% — (#2? 0, . . . ); and at each move just one of the irreducible components 
(factors) is changed (perhaps into a reducible factor in other similar games). For games of this restricted type, in which 
the last mover wins, the work of classification can be made easier without recourse to guessing by means of a function 
f2(-X") (= integer > 0) of the variable X (= general state of the game) determined by the properties 

{ any single move alters the value of fl (X ), 

if 0 < (jJ < fl (X ), the value of fl can be decreased to UJ in one move, (2) 

fl (X ) = 0 when X is terminal. 

Taking the existence of ft for granted, comparison of (2) with (1) shows that X is a W-state if and only if fi(X) = 0 
. It may also be deduced from (2) that 

fi(XY ...) = ft(x) + n(y) + • • •. O) 

N N 

This accounts for the advantage of working with fl , since all the winning positions will be known if the value of fl is 
calculated merely for irreducible states. In practice the values of fl may be found by working back from the end of the 
game, using (2) and (3). 

The following game with piles of matches is an example: the move consists of taking any pile and divid- 
ing it into two unequal parts. The game is one in which the last mover wins; and its states may be described by 
sets of co-ordinates #1,^2, • • • , giving the numbers in the heaps. We may use the same law of combination 
(x\, £ 2 , • • • ) * (y 1? 2/2 5 • • • ) = (#1,#2> . . . , 2/i, ^/2, • • • ), obtained simply by putting the two sets ofheaps side 
by side, as before; and then, since the game is of the type just considered, we have as a special case of (3) 

0[(xi,x 2 , •••)]= <d(xi)] + 0[(x 2 )] H-• (4) 

N N 

The irreducible states are those (x) with only one co-ordinate (i.e. one heap of X matches) and the law of moving is that 
(X) may be replacedby (u, v) if 

( x = u + v, u v, 
l all co-ordinates > 1. 

The values of fl [(x)], obtained (with the help of (4)) according to (2) with this law are 

x = 1, 2, 3,4, 5, 6, 7, 8, 9,10,11,12,13,14,15,16,17, .. 
n\£x)\ = 0,0,1,M?1,0,2,1, 0, 2, 1, 3, 2, 1, 3, 2, .. 

Owing to the curious repetitions, these values are quite easy to remember. Thus, as in Nim, a player using the theory can 
almost invariably win without visible calculation. 


First printed in Eureka 2,1939 



Xanadu 

T. W. Korner 


In Clarkson Road did Mathmo-man 
A stately book-store dome decree: 

Where doddering lecturers over-ran 
Through lemmas fathomless to man 
And made me miss my tea. 

So twice five miles of fertile ground 1 
With walls and towers were girded round: 

And there were courses on financial thought 
Where blossomed many an interest bearing fee 
And here were forests cheaply bought, 

The planners asked for lots of greenery. 

But O that deep ground floor 2 we wanted 
Beneath a green hill athwart a glassen cover! 

A largish void! which during term is granted 
To Part II students and where Part III s are decanted 
Alike, the pure and physics lover. 

And from this chasm with ceaseless turmoil seething, 

As if some Natsci in short wool pants were breathing, 

A mighty fountain momently was forced; 

Some bloody Auids machine had burst! 

Huge fragments vaulted like rebounded hail, 

The Clarkson Roaders wished us all in jail. 

And through the din the General Board at once and ever 
Flung out from time to time some thought, not clever, 
But which meandering with a mazy notion, 

Through Senate House talk forever ran, 

Then reached conclusions fathomless to man, 

And sank in tumult in some foolish motion: 

And Tnid this tumult Mathmos heard the boom 
Of Needs Committees prophesying doom. 

The cost of such a dome of pleasure 
Floated mid too much and more; 

You sure need a lot of treasure 
For some pavilions plus a core. 

It was a miracle of rare device. 

It only leaked but once or twice. 


A chappy 3 with a dulcimer 
In a vision once I saw: 

A fund-raiser, or so he sayed, 

And on his dulcimer he played, 

Singing of donors by the score. 

Could I but revise within three 
Days, six months of lectures wrong. 

To such high Tripos place twould win me, 
That with music loud and long, 

I would build that dome in air, 

That sunny dome! Those caves of ice! 4 
And all who heard should donors be, 

And all should cry, Let me, Let me, 

With this large cheque now set you free. 
Weave a circle round him thrice, 

And cast your eyes low on the ground, 

For he a donor now has found, 

And drunk the milk of Paradise. 

At this point the poets research grant ran out. 


'Poetic licence intended to allow for further expansion. 

2 Poetic licence again. The poet means a basement. 

3 The post was advertised in accordance with the provisions of the Sex Discrimina- 
tion Act but no suitably qualified damsel applied. 

4 Lecture Room 4. 
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The Weird and Wonderful 
Chemistry of Audioactive Decay 

J. H. Conway 


Introduction 

Suppose we start with a string of numbers (i.e. positive 
integers), say 

5 5 5 5 5 

We might describe this in words in the usual way as ‘five 
fives’ and write down the derived string 
5 5 

This we describe as ‘two fives’ so it yields the next de- 
rived string 

2 5 

which is ‘one two, one five’, giving 
12 15 

namely one one, one two, one one, one five’, or 
11121115 

and so on. What happens when an arbitrary string of 
positive integers is repeatedly derived like this? 

I note that more usually one is given a sequence such as 
55555 ; 55 ; 25 ; 1215 ; 11121115 ; 

and asked to guess the generating rule or the next term. 
The history of this problem is described elsewhere in 
this issue of Eureka. 

The numbers in our strings are usually single-digit 
ones, so we’11 call them digits and usually cream them 
together as we have just done. But occasionally we want 
to indicate the way the numbers in the string were ob- 
tained, and we can do this neatly by inserting commas 
recalling the commas and quotes in our verbal descrip- 
tions, thus 

5 5 5 5 5 
,5 5, 

,2 5, 

,1 2,1 5, 

,1 1,1 2,1 1,1 5, 

The insertion of these commas into a string or portion 
thereof is called parsing . 

We’11 often denote repetitions by indices in the usual 
way, so that the derivation rule is 

a a b p c Y d 6 . . . ^ aalSbycSd. . . 

When we do this it is always to be understood that the 


repetitions are collected maximally, so that we must 
have 

a^b, b^c, c^d, ... 

Since what we write down is often only a chunk of the 
entire string (that is, a consecutive subsequence of its 
terms), we often use the square brackets “ [“ or “] ” to 
indicate that the apparent left or right end really is the 
end. We also introduce the formal digits 

0, as an index, to give an alternative way 
of indicating the ends (see below) 

X for an arbitrary digit, possibly 0, and 
for any digit (maybe 0) other than n. 

Thus X 0 a ff b p c Y means the same as [a“b p c Y 
a“b p c Y X 0 means the same as a“b p c Y ] 
a^b^c^ 0 means a^b^c^ followedbyan 
other digit, and 

a“b p c Y (^2) * 0 means that this digit is not a 2. 

I’m afraid that this heap of conventions makes it quite 
hard to check the proofs, since they cover many more 
cases than one naively expects. To separate these cases 
would make this article very long and tedious, and the 
reader who really wants to check all the details is ad- 
vised first to spend some time practising the derivation 
process. Note that when we write 

L' - L" - . . . 

we mean just that every string of type L derives to one of 
type L', every string of type L' derives to one of type L", 
and so on. So when in our proof of the Ending Theorem 
we have 

n n ] n t 2 n* n ] n 1 ] 

the fact that the left arrow is asserted only when n^2 
does not excuse us from checking the right arrow for 
n=2. (But, since n>l is enforced at that stage in the 
proof, we neednt check either of them for n=l.) 

By applying the derivation process times to a string L 
we obtain what we call its n th descendant , L n . The string 
itself is counted among its descendants, as the 0 th . 

Sometimes, a string factors as the product LR of two 
strings L and R whose descendants never interfere with 
each other, in the sense that (LR) n = L n R n for all n. In 
this case, we say that L R splits as L . R (dots in strings 
will always have this meaning). It is plain that this hap- 
pens just when (L or R is empty or) the last digit of L n 







always differs from the hrst one of R n . Can you find a 
simple criterion for this to happen? (When you give up, 
youll find the answer in our SplittingTheorem.) 

Obviously, we call a string with no non-trivial splittings 
an atom, or element . Then every string is the split prod- 
uct, or compound , of a certain number of elements, 
which we call the elements it involves . There are infinite- 
ly many distinct elements, but most of them only arise 
from specially chosen starting strings. However, there 
are some very interesting elements that are involved in 
the descendants of every string except the boring ones 
[ ] and [22]. Can you guess how many of them there 
are? (Hint: we have given them the names Hydrogen, 
Helium, Lithium,..., Uranium.) 


It s also true (but ASTONISHINGLY hard to prove) that 
every string eventually decays into a compound of these 
elements, together with perhaps a few others (namely 
isotopes of Plutonium and Neptunium). Moreover, all 
strings except the two boring ones increase in length 
exponentially at the same constant rate. (This rate is 
roughly 1.30357726903: it can be precisely defined as 
the largest root of a certain algebraic equation of degree 
71.) Also, the relative abundances of the elements settle 
down to fixed numbers (zero for Neptunium and Plu- 
tonium). Thus, of every million atoms about 91790 on 
average will be of Hydrogen, the commonest elements, 
while about 2 7 will be of Arsenic, the rarest one. 

The Periodic Table 


abundance:- 


n E n E n inside the derivate of E n+1 : 


102.56285249 

9883.5986392 

7581.9047125 

6926.9352045 

5313.7894999 

4076.3134078 

3127.0209328 

2398.7998311 

1840.1669683 

1411.6286100 

1082.8883285 

830.70513293 

637.25039755 

488.84742982 

375.00456738 

287.67344775 

220.68001229 

169.28801808 

315.56655252 

242.07736666 

2669.0970363 

2047.5173200 

1570.6911808 

1204.9083841 

1098.5955997 

47987.529438 

36812.186418 

28239.358949 

21662.972821 

20085.668709 

15408.115182 

29820.456167 

22875.863883 

17548.529287 

13461.825166 

10326.833312 

7921.9188284 

6077.0611889 

4661.8342720 

3576.1856107 

2743.3629718 

2104.4881933 

1614.3946687 

1238.4341972 

950.02745646 

728.78492056 


92 U 3 

91 Pa 13 

90 Th 1113 

89 Ac 3113 

88 Ra 132113 

87 F r 1113122113 

86 Rn 311311222113 

85 At Ho.1322113 

84 Po 1113222113 

83 Bi 3113322113 

82 Pb Pm.123222113 

81 T1 111213322113 

80 Hg 31121123222113 

79 Au 132112211213322113 

78 Pt 111312212221121123222113 

77 Ir 3113112211322112211213322113 

76 Os 1321132122211322212221121123222113 

75 Re 111312211312113221133211322112211213322113 

74 W Ge.Ca.312211322212221121123222113 

73 Ta 13112221133211322112211213322113 

72 Hf 11132.Pa.H.Ce.W 

71 Lu 311312 

70 Yb 1321131112 

69 Tm 11131221133112 

68 E r 311311222.Ca.Co 

67 Ho 1321132.Pm 

66 Dy 111312211312 

65 Tb 3113112221131112 

64 Gd Ho.1321133112 

63 Eu 1113222.Ca.Co 

62 Sm 311332 

61 Pm 132.Ca.Zn 

60 Nd 111312 

59 Pr 31131112 

58 Ce 1321133112 

57 La 11131.H.Ca.Co 

56 Ba 311311 

55 Cs 13211321 

54 Xe 11131221131211 

53 I 311311222113111221 

52 Te Ho.1322113312211 

51 Sb Eu.Ca.3112221 

50 Sn Pm.13211 

49 In 11131221 

48 Cd 3113112211 

47 Ag 132113212221 


You should get to know the common elements, as enu- 
merated in our Periodic Table. The abundance (in at- 
oms per million) is given first, followed by the atomic 
number and symbol as in ordinary chemistry. The actu- 
al digit-string defining the element is the numerical part 
of the remainder of the entry, which, when read in full, 
gives the derivate of the element of next highest atomic 
number, split into atoms. Thus, for example, the last line 
of the Periodic Table tells us that Hydrogen (H) is our 
name for the digit-string 22, and that the next higher 
element, Helium (He), derives to the compound 
Hf .Pa.H.Ca.Li 
which we might call 

tc Hafnium-Protactinium-Hydrogen-Calcium Lithide”! 

Not everything is in the Periodic Table! For instance, 
the single digit string “1” isnt. But watch: 

1 

11 

21 

1211 

111221 

312211 

13112221 

11132.13211 = Ht.Sn 

after a few moves it has become Hafnium Stannide! This 
is an instance of our “Cosmological Theorem”, which as- 
serts that the exotic elements (such as “1”), all disappear 
soon after the Big Bang. 


abundance:- n 


E n inside the derivate of E n+1 : 


559.06537946 

428.87015041 

328.99480576 

386.07704943 

296.16736852 

227.19586752 

174.28645997 

133.69860315 

102.56285249 

78.678000089 

60.355455682 

46.299868152 

35.517547944 

27.246216076 

1887.4372276 

1447.8905642 

23571.391336 

18082.082203 

13871.124200 

45645.877256 

35015.858546 

26861.360180 

20605.882611 

15807.181592 

12126.002783 

9302.0974443 

56072.543129 

43014.360913 

32997.170122 

25312.784218 

19417.939250 

14895.886658 

32032.812960 

24573.006696 

18850.441228 

14481.448773 

11109.006821 

8521.9396539 

6537.3490750 

5014.9302464 

3847.0525419 

2951.1503716 

2263.8860325 

4220.0665982 

3237.2968588 

91790.383216 


46 Pd 111312211312113211 

45 Rh 311311222113111221131221 

44 Ru Ho.132211331222113112211 

43 Tc Eu.Ca.311322113212221 

42 Mo 13211322211312113211 

41 Nb 1113122113322113111221131221 

40 Zr Er.12322211331222113112211 

39 Y 1112133.H.Ca.Tc 

38 Sr 3112112.U 

37 Rb 1321122112 

36 Kr 11131221222112 

35 Br 3113112211322112 

34 Se 13111321222113222112 

33 As 11131221131211322113322112 

32 Ge 31131122211311122113222.Na 

31 Ga Ho.13221133122211331 

30 Zn Eu.Ca.Ac.H.Ca.312 

29 Cu 131112 

28 Ni 11133112 

27 Co Zn.32112 

26 Fe 13122112 

25 Mn 111311222112 

24 Cr 31132.Si 

23 V 13211312 

22 Ti 11131221131112 

21 Sc 3113112221133112 

20 Ca Ho.Pa.H.12.Co 

19 K 1112 

18 Ar 3112 

17 C1 132112 

16 5 1113122112 

15 P 311311222112 

14 Si Ho.1322112 

13 A1 1113222112 

12 Mg 3113322112 

11 Na Pm.123222112 

10 Ne 111213322112 

9 F 31121123222112 

8 0 132112211213322112 

7 N 111312212221121123222112 

6 C 3113112211322112211213322112 

5 B 1321132122211322212221121123222112 

4 Be 111312211312113221133211322112211213322112 

3 Li Ge.Ca.312211322212221121123222112 

2 He 13112221133211322112211213322112 

1 H Hf.Pa.22.Ca.Li 
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The Theory 

We start with some easy theorems that restrict the possible strings aiter the hrst few moves. Any chunk of a string that 
has lasted at least n moves will be called an n-day-old string. 

The One-Day Theorem: Chunks oftypes 

, a x , b x , x 4 or more and x 3 y 3 

don t happen in day-old lists. (Note that the hrst one has a given parsing.) 

Proof: The hrst possibility comes from x a x b , which, however, should have been written x a+b , in the previous day s string. 
The other two, however parsed, imply cases of the hrst. 

The Two-Day Theorem: No digit 4 or more can be born on or after the second day. Also, a chunk 3x3 (in particular 3 3 ) 
cant appear in any 2-day-old list. 

Proof: The hrst possibility comes from a chunk x 4 or more, while the second, which we now know must parse ,3 x , 3 
y , can only come from a chunk x 3 y 3 , of the previous day s string. 

When tracking particular strings later, weTl use these facts without explicit mention. 

The Starting Theorem: Let R be any chunk of a 2-day-old string, considered as a string in its own right. Then the starts 
of its descendants ultimately cycle in one of the ways 

[] or [IWJ [l 3 | [3 x X #3 or 

u r V 1 

[2 2 ] or ^I^yI [2 2 1 3 - ^^ 3 

lu [i VK m mfj 


If is not already in such a cycle, at least three distinct digits appear as initial digits of its descendants. 
Proof: If R is non-empty and doesnt start with 2 2 , then it either starts with a 1 and is of one of the types 

[iix 0 or 1 or [l 1 (2 2 or 3 or 3 2 ) or [ 1 2 X 3 or ^ 1 or [l 3 
or starts with a 2 and is one of the types 


or starts with a 3 and is of one of the types 


[2 3 X 2 or * 2 or [2 3 


or starts with some n > 3 and has form [ n 1 . 


[ 3 : X 3 or * 3 or [ 3 2 X 3 or * 3 


It is therefore visible in 

[ 1 3 2 3 [ 3 2 X 3 [l 1 ] [ 3 2 X* 3 [n 1 [ 2 3 

N X X X \ X 

[3 3 X 3 [ 1‘3 2 [1 2 X‘ [2 3 X 2 - [ 1 3 2 2 _ [1 2 X« [2 3 X« 4 [l^ 1 - [l 3 i [3 3 X« 

which establishes the desired results for it. 

This proves the theorem except for strings of type [ 2 2 R' all of whose descendants start with 2 2 . This happens only if no 
descendant of R' starts with a 2, and so we can complete the proof by applying the results weVe just found to R'. 


The Splitting Theorem: A 2-day-old string LR splits as L . R just if one of L and R is empty or L and R are of the types 
shown in one of: 


L R 

n] [m (n>4,m<3) 

2] [l^or [l 3 or [3^0^ [n 1 (n>4) 

*2] [mWor [2 2 1 3 or ^^^or [2 2 n (0 or (n>4) 


Proof: This follows immediately from the Starting Theorem applied to R and the obvious fact that the last digit of L is 
constant. 


Now we investigate the evolution of the end of the string! 











The Ending Theorem: The end of a string ultimately cycles in one of the ways: 

2.311322113212221] - 2.13211322211312113211] 

t i 

2.12322211331222113113112211] 2.1113122113322113111221131221] 


2.3122113222122211211232221ln] 

t i (n > 1) 

2.13112221133211322112211213322lln] 

n 

or 2 2 ] 

[Note: our splitting theorem shows that these strings actually do split at the dots, although we dont use this.] 
Proof: A string with last digit 1 must end in one of the ways yisible in 

V- 3 ] («^l 1 ] - (^2) x l 2 ] 2 1 or - 4 1 X ] _ 2 1 or - 3 1 2 ] 2 2 1 : ] 

and its subsequent evolution is followed on the right hand side of Figure 1. 

A string with last digit n > 1 must end n n ] or n* n ] and so evolves via 

n=2 

flL 

n n ] n 


2 2 1 2 ] -> 2 3 ! 1 ] 


* n ] - n 1 ] - ln] _ lln] - (^l)lln] 
and the last string here is the first or second on the left of Figure 1. 

(n>l) 


211n] - 2211n] 


(«)2211n] 

(^2)22211n] 

32211n] 

322211n] 

(^3)332211n] 

2322211n] 

21332211n] 

2112322211n] 

221121332211n] 

22112112322211n] 

2211221121332211n] 

221222112112322211n] 

21132211221121332211n] 


(^ 2 ) 2221 ] 

3211] 

31221] 

3112211] 

3212221] 

312113211] 

3111221131221] 

(^3)331222113112211] 

2.311322113212221] 
2.13211322211312113211] 
2.1113122113322113111221131221] 
2.311311222.12322211331222113112211] 

2.1112133.22.12.311322113212221] - 


(period 4) 


22113222122211211232221ln] 

2211332113221122112133221ln] 

22.12.3122113222122211211232221ln] 

2 .13112221133211322112211213322lln] <- 

2.11132.13.22.12.3122113222122211211232221ln] 


] (period 2) 


Figure 1: The evolution of endings other than 2 2 ] 

This figure proves the theorem except for the trivial case 2 2 ]. (When any of these strings contains a dot, its subsequent 
development is only followed from the digit just prior to the rightmost dot.) 

We are now ready for our first major result: 

The Chemical Theorem: 

a) The descendants of any of the 92 elements in our Periodic Table are compounds of those elements. 

b) All sufficiently late descendants of any of these elements other than Hydrogen involve all of the 92 elements simul- 
taneously. 

c) The descendants of any string other than [ ] or [22] also ultimately involve all of those 9 2 elements simultaneously. 

d) These 92 elements are precisely the common elements as defined in the introduction. 

Proof: 

a) follows instantly from the form in which we have presented the Periodic Table. 


b) It also follows that if the element E n of atomic number n appears at some time t, then for any m< n, all of the elements 
on the E m line of the table will appear at the later time t+n - m. In particular, 

E n at t =) Hf&Li at t+n-1 (if n>2), 

Hf&Li at t =) H f & L i at t+2 & t+71, 

Hf at t => Sr&U at t + 72-38, 

U at t => E n at t+92-n. 
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From these we successively deduce that if any of these 9 2 elements other than Hydrogen is involved at some time 1 0 , Haf- 
nium and Lithium will simultaneously be involved at some strictly later time < 1 0 + 100, and then both will exist at all 
times > t 0 + 200, Uranium at alltimes > t 0 + 300, and everv other one of these 92 elements at all times > t 0 + 400. 

In other words, once you can fool some of the elements into appearing some of the time, then soon you’11 fool some of 
them all of the time, and ultimately you’11 be fooling all of the elements all of the time! 

c) If L is not of form L'2 2 ], this now follows from the observation that Calcium (digit-string 12) is a descendant of L, 
since it appears in both the bottom lines of Figure 1. Otherwise we can replace L by L', which does not end in a 2. 

d) Follows from a), b), c) and the dehnition of the common elements. 

Now we’ll call an arbitrary string common just if it’s a compound of common atoms. 

The Arithmetical Theorem: 

a) The lengths of all common strings other than boring old [] and [22] increase exponentially at the same rate X>1. 

b) The relative abundances of the elements in such strings tend to certain fixed values, all strictly positive. 

Notes Since each common element has at least 1 and at most 42 digits we can afford to measure the lengths by either 
digits or atoms - we prefer to use atoms. The numerical value ofXis 1.30357726903; the abundances are tabulated in 
the Periodic Table. 

Proof: Let v be the 92-component vector whose (i) -entry is the number of atoms of atomic number i in some such 
string. Then at each derivation step, v is multiplied by the matrix M whose (i , j) -entry is the number of times E. is 
involved in the derivate of E.. Now our Chemical Theorem shows that some power of M has strictly positive (i , j) -en- 
tries for all i (the (1, j) -entry will be 0 for j ^l, 1 for j =1, since every descendant of a single atom of Hydrogen is 
another such). 

Let X be an eigenvalue of M with the largest possible modulus, and v 0 a corresponding eigenvector. Then the non-zero 
entries of v 0 M n are proportional to X n , while the entries in the successive images of all other vectors grow at atmost this 
rate. Since the 92 coordinate vectors (which we’11 call H, He,..., U in the obvious way) span the space, at least one of them 
must increase at rate X. 

On the other hand, our Chemical Theorem shows that the descendants of each of H, He,..., U increase as fast as any of 
them, and that this is at some rate > 1, while H is a fixed vector (rate 1). These remarks establish our Theorem. 

[We have essentially proved the Frobenius-Perron Theorem, that the dominant eigenvalue of a matrix with positive en- 
tries is positive and occurs just once, but I didnt want to frighten you with those long names.] 

The Transuranic Elements 

For each number n>4, we define two particular atoms:- 

an isotope of Plutonium (Pu) : 31221132221222112112322211n 
an isotope of Neptunium (Np) : 1311222113321132211221121332211n 

For n = 2 these wouldbe Lithium (L i) and Helium (He); for n=3 they wouldbe Tungsten (W) and Tantalum (Ta), while for 
n>4 they are called the transuranic elements . We won’t bother to specify the number n in our notation. 

We can enlarge our 92-dimensional vector space by adding any number of new pairs of coordinate vectors Pu, Np corre- 
sponding to pairs of transuranic elements. 

Our proof of the Ending Theorem shows that every digit 4 or more ultimately lands up as the last digit in one of the 
appropriate pair of transuranic elements, and (see the bottom left of Figure 1) that we have the decomposition 

Pu - Np - Hf .Pa.H.Ca.Pu. 

Now Pu±Np is an eigenvector of eigenvalue ±1 modulo the subspace corresponding to the common elements , since 
Pu^Np modulo that space. Because these eigenvalues are strictly less than X in modulus, the relative abundances of the 
transuranic elements tend to 0. 

So far, I can proudly say that this magnificent theory is essentially all my own work. However, the next theorem, the 
finest achievement so far in Audioactive Chemistry, is the results of the combined labours of three brilliant investigators. 
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The Cosmological Theorem: 

Any string decays into a compound of common and transuranic elements after a bounded number of derivation steps. 
As a consequence, every string other than the two boring ones increases at the magic rate X, and the relative abundances 
of the atoms in its descendants approach the values we have already described. 

Proof of the Cosmological Theorem would hll the rest of Eurekal Richard Parker and I found a proof over a period of 
about a month of very intensive work (or, rather, play!). we hrst produced a very subtle and complicated argument which 
(almost) reduced the problem to tracking a few hundred cases, and then handled these on dozens of sheets of paper (now 
lost). Mike Guy found a simpler proof that used tracking and backtracking in roughly equal proportions. Guy s proof still 
hlled lots of pages (almost all lost), but had the advantage that it found the longest-lives of the exotic elements, namely 
the isotopes of Methuselum (2233322211n; see Figure 2). Can you hnd a proof in only a fewpages? Please! 

The degree of X 

Plainly X is an algebraic number of degree at most 92. We hrst reduce this bound to 71 by exhibiting a 21-dimensional 
invariant subspace on which the eigenvalues of M are 0 and ±1. To do this, we dehne the vectors 


V, = H, 

v 2 = He - Ta, v 3 = Li - W, . . 

• , v 29 = Ca - Pa, 

or, in atomic number notation 



v, - E lf 

V 2 = E 2 - E 73> V 3 = E 3 E 74 > > > 

o 

II 

m 

m 

and also dehne 

v 21 = {Sc + Sm - H - Ni - Er 

- 3 U} / 2 , 

then observe that 




v V <_► V v > V 

V 4 V 3 V 2 ’ V 1 V 1 


An alternative base for this space consists of the eigenvectors 
of M with the respective eigenvalues 


v x and v 3 ± v 2 


1 and ± 1 


together with the following Jordan block of size 18 for the eigenvalue 0 


[This shows that M is one of those “inhnitely rare” matrices that cannot be diagonalised. Dont expect to follow these 
remarks unless youve understood more oflinear algebra than I fear most ofyour colleagues have!] 


2233322211n (n>l) 

223332211n 

223322211n 

222332211n 

322322211n 

13221332211n 

111322112322211n 

31132221121332211n 

132113322112112322211n 

La.H.12322211221121332211n 

1112133221222112112322211n 

Sr.3221132211221121332211n 

1322211322212221121123222lln 

1113322113321132211221121332211n 

3123222 .Ca.(Li or W or Pu) 

1311121332 

11133112112.Zn 

Zn.321122112 

131221222112 

1113112211322112 

311321222113222112 

1321131211322113322112 

111312211311122113222.Na 

3113112221133122211332 

Ho.Pa.H.Ca.Ac.H.Ca.Zn 


Figure 2: The descendants of Methuselum 
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Richard Parker and I have recently proved that the residual 7 l st degree equation for X is irreducible, even when it is read 
modulo 5. We use the fact that the numbers in a hnite held of order q all satisfy x q = x (since the non-zero ones form a 
group of order q - 1, and so satisfy x q l = 1). 

Working always modulo 5, we used a compute to evaluate the sequence of matrices 

M = M M = M 5 M = M 5 M = M 5 M = M 5 

And to verify that the nullity (modulo 5) of M n+2 - M 2 was 21 for l<n<70, but 92 for n=71. Note that the 21 vectors of 
the above ‘alternative base’ are eigenyectors of M 2 whose eigenvalues (modulo 5) lie in the held of order 5. 

If the 71 st degree equation were reducible modulo 5, then M 2 would have an eigenvector linearly independent of these 
with eigenvalue lying in some extension held of order q = 5 n (l<n<70). but then the eigenvalues cj) of these 2 2 eigen- 
vectors would all satisfy (f) q = and the 2 2 eigenvectors would be null-vectors for 

(M ) q - M = M - M 

Oi 2 7 ii 2 1 'n+2 M 2 

contradicting our computer calculations. 

It is rather nice that we were able to do this without being able to write down the polynomial. However, Professor 01iver 
Atkin of Chicago has since kindly calculated the polynomial explicitly, and has also evaluated its largest root X as 

1.3035772690342963912570991121525498 

approximately. The polynomial is 

x 71 - x 69 - 2x 68 - x 67 + 2x 66 + 2x 65 + x 64 - x 63 - x 62 - x 61 

- x 60 - x 59 +2x 58 + 5x 57 + 3x 56 - 2x 55 - 10x 54 - 3x 53 - 2x 52 +6x 51 

+6x 50 + x 49 +9x 48 - 3x 47 - 7x 46 - 8x 45 - 8x 44 +10x 43 + 6x 42 +8x 41 

- 5x 40 - 12x 39 +7x 38 - 7x 37 + 7x 36 + x 35 - 3x 34 +10x 33 + x 32 - 6x 31 

- 2x 30 - 10x 29 - 3x 28 + 2x 27 + 9x 26 - 3x 25 +14x 24 - 8x 23 - 7x 21 

+9x 20 + 3x 19 -4x 18 - 10x 17 - 7x 16 +12x 15 + 7x 14 + 2x 13 -12x 12 -4x n 

-2x 10 + 5x 9 + x 7 - 7x 6 + 7x 5 - 4x 4 +12x 3 - 6x 2 +3x-6 


First printed in Eureka 46,1986 
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Soap Bubble 

A soap bubble is an extremely thin film of 
soapy water enclosing air that forms a hol- 
low sphere with an iridescent surface. They 
will assume the shape of least surface area 
possible containing a given volume. 
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Isometric versions of 
Ramsey theorem 

Igor Protasov 

Professor of Mathematics, Kiev University 


For any natural numbers n, r, there exists a natural 
number m, such that for any T-coloring of edges of 
the complete graph K m , there is a monochrome copy 
of K n . 

This elegant statement is a graph yersion of the Ramsey 
theorem, one of the milestones of Ramsey Theory. For 
a look at the history (with exposition of the original pa- 
per of Frank Ramsey) and foundations of this branch of 
Combinatorics, see for example [2]. 

Clearly, K n contains an isomorphic copy of any graph 
with < n vertices, so for every hnite graph T and a 
natural number r, there exists a natural number m 
such that, for any r -coloring of edges of K m , there is a 
monochrome copy T' of T. But this T 7 , as a rule, lies 
in K m very non-isometrically with respect to T . We 
need some dehnitions to explain this remark. 

All graphs are supposed to be hnite connected. Every 
graph r with the set of vertices l^(r) and the set of 
edges E(T) (each edges in an unordered pair { U , V } 
of two distinct vertices) can be considered as the metric 
space (l/(r), dy) with the path metric dy dehnedby 
the rule: d(u,v) is the length (by edges) of the short- 
est path (called geodesic) between U and V. Given two 
graphs r and G, a mapping / ; V (r) —>■ V(G) 
is an isometric embedding if, for any U,V £ V(r), 
dr(u,v) = d G (f(u),f(v)). Thus, T is isometri- 
cally embedded into K m if and only if T is complete. 

Now we consider the interval graph In, 
V(I n ) = {0,1,..., n}, 

E(I n ) = {{*, i + 1} : * G {0,..., n - 1}}, 

take a natural number r, and ask the following ques- 
tion. 

Ouestion 1 Does there exist a graph G such that, for 
any r-coloring of E(G ), there is an isometric mon- 
ochrome copy of In- In other words, does G have a 
monochrome geodesic path of length 7T? 

We ask also the directed version of Question 1. 


Ouestion 2: Does there exist a graph G such that, for 
any orientations of E(G), there is a directed geodesic 
path of length n? 

After some dehnitions, notations and constructions, we 
give positive answers to both Question 1 and Question 
2, and moreover we show that G can be taken a Carte- 
sian product of n complete graphs. 

We recall that the Cartesian prod- 
uct G = G\ X • • • X G n of graphs 
G i, . . . , G n is a graph with the set of vertices 
V(G) = V{G{) X • • • X V(G n ) and the set of 
edges 

E(G) = IJ Ei(G\ 

i<n 

dehned by the rule: for U = (u\, . . . , 7X n ) and 
V = (vi,.. .,v n ), 

{u,v} £ Ei(G) UiVi £ E(Gi), 

Uk = v k , k £ {1,..., i - 1, i + 1,..., n} 
By K ™, we denote the Cartesian product of m copies 

of K n . 

To get r X K n , we take n copies of T and connect by 
edges all pairs of corresponding edges in these copies. 

For a natural number n , we say that a subgraph 
S of G is an n-star centered at the vertex O if 
S is the union of geodesic intervals R \, . . . , Rj^ 
with common end O such that the subgraphs 
R!\{0},...,R k \{0} are pairwise disjoint 
by vertices and l(R\) + • • • + l(Rk) — n > 
l(Ri) > 0, i £ {1, . . . , k}, where l(Ri) is the 
length of R n Each R^ is called the ray of S. If each 
edge of E(G) is oriented, we say that S is directed if 
each edge {?+ Vi+ 1} in its ray Vo, . . . , Vt, Vo — O 
is oriented as ViV{+\. 

We recall that diam(G) is the length of the longest 
geodesic path in G, and introduce directed and two 
chromatic diameters of G by: 
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• diam{G) : = maximal p such that, in each 
orientation of E{G\ there is a directed geodesic 
path of length p ; 

• r-diam e(G) : = maximal p such that, in each 
r-coloring of E(G)> there is an edge-mono- 
chrome geodesic path of length p. 

• r-diamy(G) := maximal p such that, in each 
r-coloring of V(G), there is a vertex-mono- 
chrome geodesic path of length p. 

Now everything is ready to supply the promised an- 
swer to Question 1 and Question 2. Just observe that 
diam(G) — U for each Cartesian product of any n 
complete graphs with at least two vertices in each. 

Theorem 1 For any natural numbers n> r, there exists 
a Cartesian product G of n complete graphs such that, 
for any r-coloring and orientation of E(G)> there is a 
directed isometrically embedded n-star S with mon- 
ochrome edges of pairwise distinct colors. In particular 

diam(G) — n and r-diamE(G) > n/r. 

Proof: We fix r and proceed by induction on n. For 
n — 1, we take G — i%. 

We assume that a graph G satisfies the conclusion for 
some n,put m = ^(G) -|- 1, and consider 

the Cartesian product H — G X K m . 

We take an arbitrary orientation O of E(H) and col- 
oring x • E(H) — > {1, . . . , r}. By the choice of 
m , there are C, d G V(K m ) such that the restric- 
tions of X an d Oon G X {c}, G X {d} coincide. 

By the inductive assumption, there is an n-star S in 
G centeredat O suchthatthe n-star S X {c} isiso- 
metrically embedded into G X {c}, directed and has 
monochrome edges of pairwise distinct colors. 

We look at the color i of the edge {(O, c), (O, d)} 
and consider two cases. If this edge is oriented as 
(O, c)(0, d), we replace the ray of color i in 
S X {c} with the ray (O, c)(0, d)R , where R is 
the ray of color i in S X {d}. If {(O, c), (O, d)} is 
oriented as (O, d) (O, c), we replace the ray of color i 
in S X {d} with the ray (O, d) (O, c)R\ where R ' 
is the ray of color i in S X {c}. 

After that, we get the desired (n + l)-star in H. 

To see that diam(G) — n we put r = 1, and 
r-diam^^G) >n/r because at least one ray of S 
is of length > n/r. 


What happens if we use the coloring of vertices instead 
of the coloring of edges? This case is easier, and after 
reading the above proof, must be clear (just use inter- 
vals in place of stars). 

Theorem 2: For any natural numbers n, r, there ex- 
ists a Cartesian product G of n complete graphs such 
that, for any C-coloring of V(G) and orientation of 
E(G), there is a directed vertex-monochrome geodes- 
ic path of length n. 

Now we take an arbitrary graph I\ natural number r 
and ask if there exists a graph G such that, for every 
r-coloring of E(G), there is a monochrome isometric 
copy of T? The answer is also positive and can be ex- 
tracted from Theorem 1.3 in [4]. By construction, this 
G depends essentially on T and could be very com- 
plicated. 

But everything is simple with the choice of G in the 
following case. 

Theorem 3: Assume that a graph T is isometrically 
embedded into K™, and let r be a natural number. 
Then there exists a natural number N such that, for 
G = an( j an y r-coloring of V(G) and 

r-coloring of E(G), there is an isometrically embed- 
ded, vertex-monochrome and edge-monochrome copy 
of r in G. 

For some characterization of graphs isometrically em- 
beddable into K™ see [5]. Evidently, I m is isometri- 
cally embeddable into K™. 

We show the way to supplying the proof of Theorem 3 in 
three steps: Theorems 4, 5, 6. 

If S i, . . . , S n are subgraphs of G i, . . . , G n 
respectively, we say that S\ X • • • X S n is a 
box in G = G i, . . . , G n . We say that a graph 
H i X • • • X Hk is box embeddable into G if, af- 
ter some rearrangement of {1, . . . , 72}, there is a box 
Si X • • • X Sk X {Uk+i} X • • • X {u n } in G 
such that each Si — H^ , ^ G {1, . . . , /c}, where — 
is the sign of graph isomorphism. 

Theorem 4: For any natural numbers n, 772, r, there 
exists a natural number A/" such that, for every r-col- 
oring of V(K™), there is a monochrome box copy of 
K n m%. 

We say that an edge-coloring x of G\ X • • • X 
is a face coloring if the restrictions of X t0 ea °h 
Ei(G\ X • • • X G n ) is monochrome. 
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Theorem 5: For any natural numbers 7T, r, there 
exists a natural number N such that, for G = K N 
and any r-coloring ^ of E{G), the restriction of \ 
to some box G \ X • • • X G k , Gi~K n is a face 
coloring. 

Theorem 6: For any natural numbers 7T, 77T, r, there 
exists a natural number such that, for every r-col- 

oring of E(K N r r ^), there is a monochrome box 

r T^m • 7/mr-r+l 
copy of iv n m K n 

We note that Theorem 4 is an elementary set-theoretical 
statement (see Lemma 1 from [3]), and Theorem 6 is a 
corollary of Theorem 5. Thus only Theorem 5 requires 
some inductive efforts (with usage of Theorem 4 and 
Ramsey theorem). After this, Theorem 3 follows direct- 
ly from Theorem 4 and Theorem 6. 

Applying Theorem 1 and Theorem 4, we see that, for 
any natural numbers 7T, r, there is a Cartesian prod- 
uct G of complete graphs such that for any r-color- 
ing of V(G), r-coloring of E(G) and orientation of 
E(G ), there exists a vertex-monochrome, edge-mono- 
chrome directed geodesic path of length m. 

We conclude with a short selection of open problems 
arising in connection with directed and chromatic di- 
ameters of graphs. 

Problem 1 Detect or evaluate diam(K™). 


A graph G is called a comparability graph (or tran- 
sitiyely orientable) if E(G) admits an orientation 
E(G) suchthat 

uv, uw G E(G ), 

implies 

n,w E E(g). 

It is known (see [1, 6.1.1]) that G is a comparability 
graph if and only if every cycle of odd length in G has 
a triangular hord. Using this characterization, it is easy 
to concly de that G is a comparability graph if and only 

if diam(G) — 1. 

Problem 2 Characterize all G of diam(G) — 2. 

Problem 3 Characterize all G of 2-diampj(G) = 1 
andall G of 2-diamy (G) = 1. 
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Solving Problems by "Cheating" 

Operational Calculi, Function Theory, and Differential Equations 
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Emeritus Professor, University of New Mexico 


At first the operators transform the functions, then the 
functions transform the operators. 

William CliiTord wrote, in Common Sense of the Exact 
Sciences , that sometimes in algebra we ask a question 
that has no answer: “If we write down the symbols for 
the answer to the question in cases where there is no 
answer and then speak of them as if they meant some- 
thing, we shall talk nonsense. But the nonsense is not to 
be thrown away as useless rubbish. We have learned by 
very long and varied experience that nothing is more 
valuable than the nonsense we get in this way...We turn 
the nonsense into sense by giving a new meaning to the 
words or symbols which shall enable the question to 
have an answer, that previously had no answer.” (Quot- 
ed by Alexander Macfarlane “The hmdamental princi- 
ples of algebra,” Aug 21, 1898, AAAS). 

Say youre in some math course, and you are handed 
these problems 

(1) Solve y" + y' — 2y = f. 

(2) Solve du/dt = Au , where A is a 5-by-5 matrix 
with complex entries. 

(3a) Solve Uf = U Xt given 7i(0, x). 

(3b) Solve Utt = (?U XX , given ?i(0, x) and 
Ut(0,x). 

(3c) Solve Ut(t, x) = U xx (t , x) for t > 0, given 
i/(0, x). 

To solve all of these problems, I will show you a secret 
two-step method. First, treat mathematical symbols like 
A or D as if they were numbers, and get “solutions” 
which are meaningless nonsense 

(1) y = f/(D 2 + D - 2). 

(2) u(t ) = e tA u(0). 

(3a) u(t,x) = e tD u( 0,x),where D = d/dx. 
(3b) u(t,x) = e ctD+ f(x) + e ctD ~g(x), where 
D+ = d/dx, D— — —d/dx and / and g 
are “arbitrary”. 

(3c) u(t,x) = e t( ^u(0,x) where Q — ( d/dx ) 2 . 

Then hgure out the meaning of this meaningless non- 
sense. 

Example 1, sometimes taught in introductory ODE, is 
usually misattributed to the telephone engineer 01iver 


Heaviside, but it was written down before Heaviside by 
a forgotten algebraist, George Peacock. He was a mem- 
ber of the Analytical Society, along with Charles Bab- 
bage, and contemporary to William Rowan Hamilton 
and George Boole. Peacock stated the following “princi- 
ple or law of the permanence of equivalent forms” in his 
1830 Treatise of Algebra: “Whatever form is Algebrai- 
cally equivalent to another, when expressed in general 
symbols, must be true, whatever those symbols denote.” 
This vague principle can be proved as a theorem in sev- 
eral ways. As we shall see, it is a principle that will serve 
us well. 

Example 2 is in a beautiful, seldom taught chapter of 
linear algebra: “Functions of Matrices”. 

Examples 3a, 3b, and 3c are “the initial-value prob- 
lem” for the “transport equation,” the “wave equation,” 
and the “heat equation”. Such problems are sometimes 
solved in graduate school as applications of “Banach al- 
gebra” or “symmetric linear operator on Hilbert space”. 
Those “modern methods” actually are pumped up ver- 
sions of the symbolic method, which we are hrst going 
to see in matrices and simple ODE s. 

Perhaps a good way to illustrate the transmutation idea 
(and in fact the whole symbolic method) is with a com- 
mutative arrow diagram from universal algebra 

A-► B 


O-►D 

You start at A with a PDE in a linear vector space. Go 
across to B with symbolic substitutions for the formu- 
las. Find a way to drop down to D with a meaningful 
representation by using the symbols in some familiar 
space such as geometry, algebra, trig, complex plane, 
matrices, some inhnite topological space, semi-groups, 
Lie algebras. This transformation might have a very 
complex commutative diagram of its own. Go across 
to C by hnding a solution in the representation do- 
main. Go back from C across to D checking conver- 
gence and validity of the solution in the domain. Finally 
map between the original problem A and the solution 
form C. 

In addition to explaining the meanings of the three 
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symbolic answers given above, we will ponder on the 
meaning of “meaning” 

In conclusion, I will offer a couple of contributions of 
my own. First there is “the method of transmutations”, 
where I transform the solution of one operator differ- 
ential equation into the solution of another one. Then 
theres a trick where I produce a stable, convergent dif- 
ference scheme of any desired rate of convergence, to 
any well-posed linear initial-value problem. 

Booles Symbolic Method for 
Simple ODEs 

We start with a venerated old operator calculus, com- 
monly misattributed to the telephone engineer 01iver 
Heaviside. Basic operations you apply to functions in a 
calculus course are 

• D y differentiating: f(t), a distance hmction, 
goes to f'(t ), its velocity. 

• anti-differentiating or integrating: 
f(t) goes to a function F(t) satisfying 

p\t) = m. 

• E^y shifting: Efrf(x) = f(x + h), where h 
is a real number, a “parameter” that can be chosen 
as convenient. Then iterating or repeating, E^ n 
times gives f(x-\- nh). Inverting is just shifting 
back to the left. The two basic operators on func- 
tions, D and E^, are related by a beautiful for- 
mula. Taylor s theorem in calculus can be rewritten 
in operator notation 

f(x + h) = Y,h n f {n \x)/nl 

becomes 

E h f(x) = J2(hD) n f(x)/n\, 

which we can simplify, dropping the general victim 
‘7”to 

E h = J2(hD) n /n\ 

Any student of calculus should recognize the right-hand 
side as the power series of the exponential function 
with base e and “variable” hD . In brief, E^ = 
and hirthermore hD = log (Ef/). The formula 


hD = log (Efr) opens the way to an approximation 
of D by powers of E^ the beginning terms of a se- 
ries expansion of the log. That is to say, a higher-order 
approximation of diherentiation by higher-order hnite 
diherences. 

These two beautihil formulas connect with and enlight- 
en an important practical problem: discretizing differ- 
ential operators. Getting good workable approxima- 
tions using hnite differences. 

If we move up to hmctions of 2 or 3 variables, shihing 
leads straight back into matrix theory. And if we divide 
by h aher shihing by h, we are doing hnite diherences 
and almost back to D — d/dt. George Boole, most 
remembered for his logic, wrote a once popular book on 
the calculus of hnite diherences. 

With that general introduction out of the way, we can 
turn to Problem 1 above. We will learn an effective 
elementary technique to solve a linear ordinary dif- 
ferential equation with constant coeihcients, of arbi- 
trarily high order. We are given y" + y' — 2y — f . 
The key step is to abandon Newtons “dot” notation 
for derivatives, and rewrite the equation in Leibnitzs 
“D” notation: D 2 y + Dy — 2y = f(t), and 
then to “factor out” the differential operators, thus: 
(D 2 + D — 2)y(t) = f(t). Then we must dare 
to write the “solution” in this childish, meaningless 
way, as y — f(t) / (D 2 + D — 2), or equivalently 
y — (D 2 + D — 2) ^ f(t). Then all wehaveto do 
is to hgure out the “meaning” oi (D 2 -\- D — 2) \ 
But if we think of factoring D 2 + D — 2 as 
(D + 2 )(D — l),then 

l/(Z ) 2 + n-2) = [l/(D + 2)][l/(D-l)], 

or 

(D 2 + D - 2) -1 = (D + 2)~ X (D - l)” 1 . 

We will be done if we can just invert (D + 2) and 
— 1), and apply those inverses successively. 

But Peacock and Boole and Heaviside showed how to 
make the solution even simpler, for 

l/(D 2 + D - 2) = 1/(D + 2 )(D - 1), 

= \[l/(D-l)-l/(D + 2)\. 



That is 

(D 2 + D- 2)” 1 = g[(£> - 1) _1 

~(D + 2)- 1 ]. 

We have written the reciprocal of a product as a sum 
of reciprocals of individual factors, by the method of 
“partial fractions” which we learned as a useful trick in 
integral calculus. Now, instead of having to successively 
invert the factors of D 2 + D — 2, we invert them si- 
multaneously, and then add the results. (Tbe very same 
little trick reappears below, in functional calculus, un- 
der the impressive name of First and Second Resolvent 
Identities.) 

So it all comes down to inverting D — p, for an arbi- 
trary number p . To invert D — p - to fmd (D — p) 1 
- just means, solve the equation y' — py = /, for “ar- 
bitrary” /. But thats easy! You saw it already in calcu- 
lus, then again in introductory ODE. The answer is 

y(x)=e px (J e~ px f (x)dx + c). 

where the “arbitrary constant (7” remains to be de- 
termined by some initial condition. So we rewrite 
[l/(U-p)]/as 

e px (J e~ px f(x)dx + Cy 

and dehne the inverse, (D — p) \ as 

(D - p)~ l f(t) = Ce pt + e pt J e~ pt f(t)dt. 

In our example 

y(t) = g (c\e 2t + e 2t J e~ 2 t f(t)dt 

+ C^e 1 + e * J ef f (t)dlj, 

In summary 

• To invert the product of two operators, multiply 
the inverses of the factors. 

• To invert a composite differential operator such as 
(D — p)(D — q) invert the factors consecu- 
tively, to get the inverse of their composite. 

To solve Problem 1, we just wrote y(t) as 
(D + 2 )~^(D — 1) _1 /. This is a successive op- 
eration with one inverse operator following another. 
In the same way, any polynomial p(w ) with real or 
complex coethcients can be factored into a product of 
hrst-degree factors (w — Wj ), and the reciprocal of 
the polynomial p(w) is the product of the reciprocals, 
1/ (w — Wj). The general problem of solving linear 
constant coethcient ODEs is thereby reduced to hnding 


the meaning of [1 /(D — Wj)]f. Tbat is the same as 
solving the brst order ODE y' — Wjy = /, and we 
do that by a simple calculation using integration by 
parts. The answer from your calculus course is 

y(t)=e pt J e~ pt f(t)dt + C, 

(where p = Wj). The inverse of the differential poly- 
nomial p(D) is the product of the inverses of its fac- 
tors D — Wj. If the roots are distinct, you can expand 
the composite inverse operator by means of partial frac- 
tions, invert each piece, and then just add them up. 

But what about a differential polynomial with a complex 
root? Lets say it s z, to make it simple. Looking back at 
what we just did, the problem now comes down to a hrst 
order DE. with a complex coethcient, y' + iy = / . 
In the solution formula we copied from calculus, the 
complex number i appears in an exponent. We are back 
to the exponential hmction we just struggled with, but 
with a complex exponent. 

Youprobablyalreadyknowthate^ = COS t + isint. 
With this interpretation, the inversion we accomplished 
with real roots also makes sense for complex roots. If a 
complex root has nonzero real part r, the exponential 
hmction in the solution is multiplied by a factor e rt . 

A careful logical critic of some steps in this derivation 
could demand, “What do you mean by an equality be- 
tween two meaningless expressions?” Peacocks prin- 
ciple of permanence of equivalent forms is a heuristic 
principle, not a precise theorem. Our answer might be, 
“We expect in the end to interpret these expressions 
sensibly, and aher such an interpretation, the two sides 
will really be equal.” 

For real X , the elementary identity 

l/p(x) =Ecj/(x- Wj), simply “means” that 
p(w) X c j /( w — w j) = 1- So the “meaning- 
less” identity, about a differential polynomial p(D ), 
l/p(D) = Y.c-j/(D -wj) , can be said to simply 
“mean” p(D) Cj(D — Wj)~ l f — /. We dont 
bother to specify appropriate conditions on /, because 
we can simply “plug this expression into the differential 
equation” to verify that it really is a solution. (We do not 
claim to have found ALL solutions.) 

Now we move on to Problem 2. 

Functions of Matrices 

In Problem 2, we are looking at the letter e with the 
letters ^ attached at its upper right corner. That would 
make sense if the letter A stood for a number. But what 
entitles us to stick a matrix up there? 

We hrst encounter this exponential notation in ele- 
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mentary school, with a natural number as exponent. It s 
just the number of times you multiply the base times 
itself. Then we learn to accept fractional and negative 
exponents, by requiring the exponential function to sat- 
isfy the formula e a = e a+ ^. Finally, we can even 
understand an exponent which is a complex number, 
because the power series expansion of the exponential 
equals the sum of the series expansion of the cosine plus 
i times the expansion of the sine. 

Oh, thats it! We can just take the power series 
e ct = J2( ct ) U / n ’ w hich converges for all values of 
the real or complex variable t, and replace the numeri- 
cal coehicient c with a matrix coelhcient A ! 

Well, why not? ( tA) n is easy, we know how to mul- 
tiply A times itself. Matrix multiplication is not the 
same as numerical multiplication, but it obeys all the 
same axioms, doesnt it? Oh no, the commutative law 
is false for matrices. But wait! The powers of a single 
fixed matrix do all commute with each other. The Ti th 
power times the TTl th is the ( m + n) th power, in ei- 
ther order of multiplication. So the terms of the power 
series still make sense if they are (tA) n / n\ instead of 
(ct) n /n\ 

But what about convergence? The power series of the 
numerical exponential converges, because the tail of the 
series is very small. Even if ct is very big, (ct) n / n\ 
will be less than the n th power of “epsilon”, for epsilon as 
small as you like, as long as n is bigger than some M 
(which depends on epsilon). Consequently, the Taylor 
series of the numerical exponential converges because it 
is majorized by a convergent power series. 

Does this work for the matrix Taylor series?? Yes! We use 
the beautiful Cayley-Hamilton Theorem: “Any square 
matrix satishes its own characteristic hmction.” Mean- 
ing, the scalar polynomial p(x) — det(A — x) 
becomes the zero matrix when the scalar variable X is 
replaced by the matrix A. First I will use Cayley-Ham- 
ilton, and then I will prove it. 

Elementary algebra tells us that if p(w) is a polynomial 
of degree n, and m > n, then W m can be divided 
by p(w), with a quotient Q m and a remainder R m \ 
W m = Qm( w )p( w ) + Rm( w )- The remainder 
R m has degree less than n. By Peacocks principle of 
the permanence of equivalent forms, we can replace W 
by A, and write A m = Q m (A)p(A) + R m (A). 
ByCayley-Hamilton, p(A) = 0,so A m — R m (A) 

, which is a polynomial in A of degree less than the 
degree of A . As a consequence, any polynomial in the 
matrix A can be reduced, term by term, to a polynomi- 
al of degree less than n. 

To prove the Cayley-Hamilton theorem, remember that 
any 77+ degree polynomial in a complex variable 2: can 
be written as a product of n linear factors (z — Zj). 


Therefore, by Peacocks principle again, the charac- 
teristic polynomial of A can be factored into a prod- 
uct of n linear factors (A — Zj), 1 < j < n. The 
roots Wj of p are the “eigenvalues” of A. Each factor 
(A — Wjl) annihilates a corresponding j th eigenvec- 
tor. If the eigenvalues are distinct, the eigenvectors span 
the space, and p(A) annihilates every vector in the 
space, so it must be the zero matrix. (If there is a double 
root of p, there is a corresponding 2-space that is anni- 
hilated, and so on). The proof is complete. 

tA 

When we write down the exponential hmction e , we 
expect and require it to satisfy two fundamental iden- 
tities: d/dt e ctA = cAe ctA , and e A+B = e A e B . 
The validity of these formulas for the matrix exponen- 
tial, written as power series, follows by a simple argu- 
ment from their validity term by term for the power 
series of the scalar exponential. This is part of Peacocks 
“principle of permanence of equivalent forms.” We will 
spell it out in detail. 

tA 

Theorem 1: If the hmction e is dehned by the 
power series for the scalar exponential function, then 
d/dt e tA = Ae tA . If the two n-by-n matrices A 
and B satisfy AB = BA,\htn e A e B = e A+B . 
Proof: Since the numerical exponentials e x and e y 
satisfy the two proposed identities, their power series 
when multiplied term by term or differentiated term- 
by-term satisfy those identities, that means that the co- 
ethcients of the n th power on both sides of the “equals 
sign” are the same, for all n. But then the two series 
will be identical if instead of numerical variables X and 
ij, any other symbols are substituted which satisfy the 
same rules for addition and multiplication. 

The reasoning we have used regarding the exponential 
function works just as well for ANY square matrix, and 
ANY function represented by a power series (any “ana- 
lytic hmction.”) The resulting matrix-valued hmction of 
matrices will satisfy any identity satished by the original 
analytic hmction of a real or complex variable. For in- 
stance, all of the multitude of trigonometric identities, 
which originate in reasoning about triangles and circles, 
continue to be valid for the complex hmctions sin z 
and COS and continue to be valid as matrix-valued 
hmctions of arbitrary matrices, as long as all the matri- 
ces in any formula are mutually commutative (as will be 
true if they are all hmctions of one matrix A ). 

The identities are valid for all Z and A, even if the 
series is convergent only in a bounded region of the £ 
plane. This is an important example of Peacocks princi- 
ple of permanence of equivalent form. 

Complex numbers are included, because complex num- 
bers z = a + bi can be represented by special two- 
by- two matrices Cil + bJ , where / is the 2-by-2 
identity, and J is 
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This is wonderful magic, and it s too good to be true, at 
least too good to be true always. The symbolic method 
can lead to disaster. A bit of caution is required! Multi- 
ple-valued functions are dangerous. Tbe very simplest 
one, the square root, applied to the 2-by-2 identity ma- 
trix, takes on not two but four values. (Just put plus-or- 
minus signs in front of the two Ts in the identity matrix, 
to display its four square roots.) 

Once we feel comfortable with power series of matrices, 
we naturally ask, what other functions besides the ex- 
ponential can we compute by power series? The inverse 
function is a tempting example. Tbe very simplest prob- 
lem in linear algebra, solving a system of linear equations, 
is nothing more than bnding the inverse of a given square 
matrix. And the very hrst power series that you met 
in school was 1+X + X 2 + -- - = 1/(1— x). 
1/(1 -*) is the multiplicative inverse of 1 — X, so 
if we replace X by A , we must get the inverse of the 
matrix I — A. (“ /” of course means the n-by-n 
identity matrix). To check this out, take a very simple 
example. Choose A as a scalar diagonal matrix, with 
all entries on the main diagonal equal to 10, and all 
off-diagonal entries zero. Then I — A is the scalar 
matrix —9, and its inverse is the scalar matrix — 1/9. 
So then, is —1/9 the sum of the powers of 10? Does 
1 + 10 + 100 + • • • = —1/9???? Nonsense! The 
sum of the powers of 10 diverges. It does not “exist.” The 
supposition that it “exists” is false. Its no surprise that a 
false conclusion comes from a false hypothesis. 

Worse yet, choose as p(B) the identity matrix /. 
Then the sum of powers p(x) would be just 
/ + / + /+••*, the sum of copies of /. The n th 
partial sum would be n/. But / — X is just the zero 
matrix, and 1/(1 — X) means “Divide by zero”, so of 
course the series has to diverge. 

On the other hand, if the entries in the matrix A are 
small, then the sum of the powers of A ought to con- 
verge. How small should they be? In the scalar case, you 
probably know that the sum of the powers X n converg- 
es if and only if X is less than 1 in absolute value. What 
is the analogous statement for the sum of the powers of 
a matrix A ? That is your hrst homework assignment. 

Exercise: For which square matrices of real or complex 
numbers does the power series / + A + A 2 + • • • 
converge? 

You will fmd it hard to answer this without hrst stating 
the meaning of “convergence” for a sequence of matri- 
ces, and what you would mean by saying a a matrix is 
“smaller” than the identity matrix. Have fun! 

It is tempting to try to carry over to the differential op- 


erator D or p(D) the same trick we just saw for in- 
verting (some, not all) matrices. Is D “small” so that 
the sum of powers of D will converge to / — D ? No, 
not at all. Saying D is small, in the appropriate sense 
here, would mean that the powers of D get small as 
you take higher powers, which can only mean that for 
some sufficiently inclusive class of hmctions /, the sum 
of f and all its derivatives is small. That works if f is 
a polynomial, or even an entire analytic function of 
exponential type. (If you dont know what that means, 
dont worry about it, just keep on reading). But we dont 
want to restrict ourselves only to such “nice” functions. 
The problem itself is meaninghil for functions f that 
are merely differentiable once, or even with sharp cor- 
ners here and there. We shouldnt need derivatives of 
order higher than the first. So power series in D is 
not the right way to make sense of 1/(1 — D) or 
1 /(p — D). Heavisides method is the right way to go. 

Another Functional Calculus 
for Matrices 

Can we find another interesting class of matrices that 
all commute with each other? Yes! The diagonal ma- 
trices, with zeroes in all the off-diagonal positions. A 
hmction f(A) of a diagonal matrix A is easy to write 
down - just apply f separately to each number on the 
main diagonal of A . Of course, that hmction must be 
well-defined for all the entries on the diagonal of A. 
For example, we cant define f(A) as A 1 if A has 
a zero on the main diagonal, which is the same thing as 
saying A has zero as one of its eigenvalues. A matrix 
with a zero eigenvalue is singular, non-invertible. On 
the other hand, if A is non-singular, and has an inverse 
A _1 , we can write powers of A ~\ which give us the 
“Laurent series” of A itself. Singularities of f(z) at 
points in the complex plane other than 0 do not inter- 
fere. 

This method might seem to be of limited interest, since 
diagonal matrices are so special. But starting with the 
diagonal, we can go to a much much bigger class of 
matrices by the relation of similarity (a term used in a 
different way in plane geometry.) We say “A is similar 
to B” if there is a non-singular change of variables that 
transforms A to B , meaning that there is an inverti- 
ble matrix P such that A = P^BP. In that case, 
any function f(A) (a polynomial hmction or the limit 
of a convergent sequence of polynomial functions) will 
be similar to f(B)- To see this, first consider just the 
squares of A and B : 

,4 2 = (P- X BP){P- 1 BP), 

= (P _ 1 B)(PP _ 1 )(SP), 

= (P~ l B)I(BP) = P~ l B 2 P, 
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which means exactly that A 2 is similar to B 2 . The 
same calculation repeated n — 1 times will show that 
A n is similar to B n . for all n , by the same transfor- 
mation P and P _1 . 

Also, if A is similar to C , and B is similar to D by 
the same transformation P and P \ then A + B is 
similar to C + D . Therefore by addition, p{A), any 
polynomial function of A, is similar to p(B). Going 
over to limits by continuity, we end up being able to ap- 
ply irregular functions such as step functions, piecewise 
linear functions, etc, to any B which is similar to some 
diagonal matrix A . 

Symmetric and skew-symmetric matrices turn out to 
be similar to diagonal ones. The entries on the diagonal 
matrix A are just the eigenvalues or characteristic val- 
ues of B , the solutions of the characteristic equation. 

So we have two different “functional calculi” for matri- 
ces. For any analytic hmction /, and any square matrix 
A , the Taylor series of f gives us an interpretation of 
/ (Al). On the other hand, if A is similar to a diagonal 
matrix (A is symmetric, skew-symmetric, “normal”, or 
has n distinct eigenvalues) then all we require from f is 
to be dehned on the diagonal elements of D , which are 
simply the eigenvalues or the spectrum of A.No con- 
ditions of regularity or boundedness on f are needed! 

In both of these two hmctional calculi, any identity 
satished by p(x) is also satished by p(A). (Peacocks 
principle.) It is important to realize that with this “spec- 
tral” interpretation of the functional calculus, the seem- 
ingly obscure meaning of / (A) for an arbitrary ma- 
trix A and any arbitrary function / becomes easy to 
understand. If A is similar to a diagonal matrix, then 
by choosing the eigenvectors as coordinates, we see that 
f(A) operates independently in each coordinate di- 
rection, where it is simply multiplication by f(wj), 
the hmction / evaluated at the j th eigenvalue. 

The Riesz-Dunford and Lap- 
lace-Phillips Functional Calculi 

If you have had complex variables, you should remem- 
ber the Cauchy integral formula. This is the surprising 
fact that for any analytic hmction f(z), if you integrate 
the quotient f(z)/27vi(z — w) around a simple 
closed curve containing the complex number W in the 
interior, you get the value of W at that point! (If this 
is news to you, take it on faith for the purpose of this 
article. The dehnition of complex integration works in a 
very natural way, with the usual rules carried over from 
calculus.) 

Now, in this integration formula, nothing prevents us 
from replacing the complex number W by a matrix A. 
Then instead of dividing f(z) by Z — W, we “divide” 


it by Z — A. What does that “mean”? Just as in Booles 
operator method in Section 1 above, it means “Mul- 
tiply f(z) by an inverse operator, (z — A) lw . The 
standard dehnition of the integral survives under this 
substitution. We now have a meaningful matrix-val- 
ued expression, which is the obvious candidate to serve 
as the dehnition of f(A). This formula is called the 
“Riesz-Dunford hmctional calculus.” 

Voila! We have extended the analytic hmction / from 
the domain of the complex numbers to the domain of 
matrices! The set of complex numbers Z where Z — A 
is not invertible is called the spectrum of A, or the set 
of eigenvalues of A . The complement of the spectrum 
in the complex plane is called “the resolvent set of A” 
and (z — A) ] L is called “the resolvent matrix”. It is 
a matrix-valued analytic hmction of Z, and we can in- 
tegrate it with respect to Z. In the elementary case, the 
operation A is just multiplication by a fixed complex 
number W, and we are looking at Cauchys integral for- 
mula. We can multiply the resolvent (z — w) 1 by 
any hmction that is analytic or holomorphic in the in- 
terior of the simple closed curve, and by integrating get 
/ (w). The same integration with respect to complex z 
, and with a matrix A instead of the complex number w, 
gives us a dehnition of f(A) for any analytic hmction 
/ and any square matrix A. One little detail should 
worry us. Is the value of this integral independent of the 
path of integration? In fact, we have to be carehil about 
where we go with respect to the “spectrum,” the eigen- 
values. In the hmctional calculus based on diagonaliza- 
tion, we needed to be sure the hmction / being applied 
to a matrix A is dehned on the spectrum or eigenvalues 
of A.ln our present construction of f(A), we have to 
be sure all eigenvalues of A are in the interior of our 
path of integration. With that precaution, Peacocks 
principle of the permanence of equivalent forms is still 
good. An identity satisfied by the complex-valued ana- 
lytic hmction / (z) is also satisfied by the correspond- 
ing matrix-valued hmction of the matrix A . 

A 

Exe rci se: For e defined by the Riesz-Dunford formula, 
and assuming AB = BA y prove P = e A e B 
and d/dt e tA — Ae tA . 

Since the exponential hmction is an entire analyt- 
ic hmction (has no singularities in the finite complex 
plane), in this example the path of integration can be 
taken with arbitrarily large radius. 

There are two useful operator identities in the 
Riesz-Dunford calculus, which are stated in terms of a 
hmction R(z, A) = (z — A) / 

• First Resolvent Identity: 

R(z, A) - R(z , B) = R(z, A)(B - A) 

xR(w , A). 

• Second Resolvent Identity: 
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R(z, A ) - R(z, B ) = R(z, A)(B - A) 

xR(z, B). 

You might suspect that if the operator is D = d/dx, 
this is just the same as Heaviside’s partial fractions for- 
mula. 

Exercise: Verify this suspicion. 

Still More Functional Calculus 
for Matrices 

In Cauchy s formula, a function of two variables, Z and 
W, is integrated with respect to one variable to yield a 
function of the other variable. In several familiar for- 
mulas analogous to Cauchy s, a function of two varia- 
bles f (£, w) is integrated with respect to t to yield a 
function F(w ). In the Cauchy formula, the hmction 
we end up with is the numerator of the quotient being 
integrated, but evaluated at an interior point. In trans- 
forms like the Laplace and Fourier, the result of integra- 
tion is a new hmction of the parameter, which is called 
the Laplace or Fourier transform respectively of the 
original hmction that was integrated. Just as in the case 
of the Cauchy formula and its matrix version, we can 
replace the scalar parameter in the Laplace or Fourier 
transform with a matrix A, and use this integral as a 
dehnition of the transform hmction applied to A . 

In the Cauchy and Riesz-Dunford calculi, we replace 
the complex parameter w in the kernel (z — w) 1 
by the operator A to get the kernel (z — A) \ In the 
Laplace transform, where the real-valued kernel e wt 
is multiplied by an input hmction f{t) to yield the 

transform F(w), we can replace the real exponential 

_ tj\ ) 

hmction e by the operator exponential e . The 

result is an operator-valued hmction of A , which can 
and should be called F(A). 

In the Riesz-Dunford calculus, the hmction of a com- 
plex variable f(z), known on the boundary of a region, 
is extended to a point W inside that region merely by 
use of the special hmction 1 /(z — w) , the resolvent. 
The properties of f near W can be deduced from the 
knowledge of l/(z — w). So if the same formula is 
used to dehne f as applied to an operator A, it is mere- 
ly on the basis of the operator resolvent (z — A) \ 
It is necessary to justify this by showing that the resol- 
vent of A has all the information about A , just as the 
haction 1 / (z — w) does so for the number W. This 
is the importance of the two Resolvent Identities. They 
show that the operator resolvent works just like the 
complex valued resolvent. 

Phillips evidently noticed that just as in the Riesz-Dun- 
ford calculus the resolvent of A serves as a building 
block to dehne a whole large class of hmctions of A , 


so the semi-group generated by A can be used as a 
building block to dehne a whole large class of of func- 
tions of A . 

For example, the Laplace transform of the real-valued 
exponential hmction e at , which is the result of inte- 
grating hom 0 to inhnity e at e wt isl / (w — a) . 

So if we integrate hom 0 to inhnity the operator-valued 

_ 'uut 

hmction e e , we get the operator-valued func- 
tion (w — A) . Surprise! We have recovered the 
‘‘resolvent hmction” of the Riesz-Dunford calculus! The 
Riesz-Dunford hmctional calculus is the Laplace trans- 
form of the Hille-Phillips hmctional calculus! 

This hmctional calculus is Chapter 15 of Functional 
Analysis and Semi-groups by Hille and Phillips, where 
it is credited to Phillips. I knew Phillips when I was an 
instructor at Stanford University in 1963-64. He col- 
laborated with my adviser, Peter Lax, in their massive 
theory of scattering. A few years later I worked with 
Einar Hille, when he was at the University of New Mex- 
ico after retiring from Yale and before moving on to the 
University of California in La Jolla. 

Nelson Dunford was a professor at Yale University. With 
Jack Schwartz he co-authored their three-volume bible, 
Linear Operators. Schwartz was one of my professors at 
NYU in the 1950s. FunctionalAnalysisby Frigyes Riesz 
and his pupil Bela Sz.-Nagy was our textbook. Riesz was 
one of the preeminent founders of hmctional analysis. 
He spent most of his life at the University of Szeged in 
Hungary, because in the University of Budapest math 
department Leopold Fejer was already present, and 
“one” was the maximum number of Jews allowed there. 
In 1988 on a visit to Szeged I had lunch with Nagy, who 
was then 75 years old. Riesz was no longer alive. Nagy 
told me that Riesz had survived the Holocaust by hiding 
in his apartment, where friends brought him food. 

Symbolic Solutions of Partial 
Differential Equations 

Now we must turn to numbers 3a, b, and c, in our imag- 
inary test questions. Any linear initial-value problem— 
the heat equation, the wave equation, the Schrodinger 
equation - are the most familiar - can be represented 
as du/dt = Au , with appropriate choice of A, and 

so can be solved symbolically by u(t) = e tj ^u( 0). 
tA 

What is the meaning of e ? 

Problem (3a), du/dt = C du/dx , is the simplest 
and easiest. If we represent spatial differentiation by 
the symbol D , this PDE becomes du/dt = cDu. 
Returning to our naive innocent mode, we would 
just write down 7i(t, x) = e ctD Uo(x). But in Sec- 
tion 1 we met Booles formula, which informs us that 
e hD f(x) = E h f(x) = f(x + h). The group of 
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operators generated by cD is just shitting to the right 
and left at speed C, u(t , x) = u{ 0, X + ct). 

But notice a striking discrepancy. These solution op- 
erators can operate on any /, even one with sharp 
corners or jumps (a step function.) Such irregular 
functions cannot satisfy any differential equation. Can 
we hnd a way to make sense of differentiating non-dif- 
ferentiable functions? (Hint: Yes. Instead of classical 
hmctions, use generalized hmctions, also known as 
Schwartz distributions.) 

We can just as well write down the PDE 
du/dt — —c du/dx, with solution 

u(t , x) — u( 0, X — ct). If we take second deriva- 
tives, we get the same second-order pde for both shihs, 

2 

Problem (3b), Utt = C U xx . This is the one-dimen- 
sional wave equation, which describes signals propa- 
gating along a string or a wire at speed C. In operator 
notation, Utt = C 2 D 2 U or, using operators Dt and 
D x , (Dt — C 2 D 2 )u = 0. With operators just as 
with numbers, the difference of two squares is easily 
factored, to (Dt + cD x )(Dt ~ cD x )u = 0. 

This factorization can be rewritten in the opposite or- 
der. Each factor annihilates its corresponding shih op- 
erator. (Dt + cD x ) annihilates any hmction of the 
form f(x — ct ), and (Dt ~ cD x ) annihilates any 
hmction of the form f(x + ct ), so any linear com- 
bination of a hmction of X + ct and a hmction of 
X — ct isannihilatedbytheproductofthetwoopera- 
tors. A general solution requires two arbitrary hmctions 
/ and g, u(t , x) — f(x + ct) + g(x — ct). By 
appropriate choice of / and g , we can satisfy two in- 
itial conditions, for both the initial value of u(t,x) 
itself at time t — 0, and the initial value of its deriv- 
ative Ut . 

Exercise: Now consider the Cauchy data 

u(0,x) = g(x)y Ut(0, x) = h(x). Derive 
dAlemberts formula 

u(t, x) = ~{g(x — ct) + g(x + ct)} 

x-\-ct 

Km. 

__—ct 

The initial value, u(0,x) sends half of itself to the 
right and half to the leh, at speed C. To this is added 
the integral of the initial velocity Ut( 0, x) , also sent 
to the right and leh at speed C. 

In Problem (3c) we meet another important elementary 
pdeofevolution,the“heatequation” Ut = U xx , which 
has the symbolic solution u(t,x) = exp (tD 2 ). 
We will obtain a solution valid for all real X , and for all 
positive time. It will model heat flow from an instan- 
taneous heat pulse concentrated at the origin X = 0 


at time t — 0. 

You might know that the Fourier transform turns the 
space derivative D into multiplication by a new space 
variable. So to interpret exp(+) 2 ).we might hrst con- 
sider exp(x 2 ). But this unpleasant hmction has ex- 
plosive growth. Better try instead exp(— X 2 ). This is 
nice. It is a multiple of the familiar bell curve or normal 
curve from statistics. One interpretation of the heat 
equation (or “dihusion equation” as it also called) is an 
evolution of bell curves, starting with a singular peak 
at zero, and hattening out more and more, eventually 
going to zero everywhere. In other words, the variance 
goes from zero to inhnity as time goes from zero to in- 

hnity. We can accomplish this by putting a factor 1/ct 

2 

into the exponent along with —X . (The coethcient C 
lets us adjust the rate of decay to ht in with the given 
diherential equation.) 

Thus we have a trial hmction exp(— X 2 / ct). This ex- 
pression goes to 1 as t goes to inhnity, pointwise for all 
X. Ast goes to zero, for any X not equal to zero, it goes 
to zero, with exponential rapidity, but for X = 0, it is 
constant = 1, independent of time. Physical reasoning 
tells us that a positive amount of heat energy initially 
concentrated at a single point would have to be very, 
very hot. Much more than 1. Also, at all values of X 
, the temperature should eventually fall to zero, as the 
heat energy moves away to the right and leh. So we need 
a correction factor, some negative power of t that gets 
really big as t goes to zero, and gets small as t goes to 
inhnity. Not knowing in advance what is the right neg- 
ative power, we can just call it a, and now have as our 
trial solution U = t a exp(— X 2 /ct). “Plug this in” 
to the diherential equation, calculate Ut and U xx , set 
them equal and solve for CL and C. Sure enough, you 
will get C = 4 and a = — 1 / 2. 

This checks out as a solution to Problem (3c). But you 
may object, “There are many other solutions of this 
equation. What is so special about this one?” In fact, 
this is ohen called “the hmdamental solution.” To see 
why, suppose you are given some arbitrary hmction 
/h) as the initial temperature. What is the temper- 
ature u(t,x) at future times? The equation is linear, 
so we can add solutions to get new solutions. Think of 
this initial temperature f(x) as the “sum” of separate 
contributions, which are each concentrated at a single 
point, and zero elsewhere. The solution corresponding 
to each one of these isolated initial temperatures is the 
hmdamental solution we have just found, shihed over 
to the particular point in question, and multiplied by an 
appropriate factor, according to the value of the initial 
temperature f(x) assigned at that point. The “sum” of 
all these separate pointwise contributions to the initial 
temperature is the integral of the product of the ini- 
tial temperature f(x) times the shihed hmdamental 
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solution we have just obtained. This integral of a shifted 
product is called the £C convolution” of the two functions. 
Thus the special solution we have just derived is the seed 
from which all other solutions of the initial value prob- 
lem are expressed as convolution integrals. 

We have solved all three of the last group of test prob- 
lems, which came from from the advanced subject of 
linear partial differential equations. Each of the differ- 
ential operators appearing in these initial-value prob- 
lems is the c generator” of a “semi-group” of “solution 
operators”, which we represented symbolically as the 
exponential function of the generator. The generator is 
a differential operator. It is the derivative of the solution 
operator. These generators and semi-groups operate on 
inhnite-dimensional function spaces. 

Some linear partial differential equations are useful in 
physics. To study them, we must move up to inhnite 
dimensional spaces. That may sound intimidating. But 
after all, we have been talking about n-by-n matrices 
without restricting n. If n is hnite but “really really 
big”, wouldnt that be close to inhnite dimensions? To 
put it into language more acceptable in the classroom, 
can we do inhnite dimensions by approximating from 
hnite dimensions? Just as we approximate n by a hnite 
decimal expansion, carried out as far as necessary. 

In working with inhnite-dimensional operators, we 
must make new distinctions which did not appear in 
the hnite-dimensional world of matrix theory. We must 
distinguish between bounded and unbounded opera- 
tors. The differential operators are unbounded. As such 
an operator is iterated, it requires a more and more re- 
stricted function space on which to operate - smooth- 
er and smoother functions. The solution operators, in 
contrast, are bounded. This is already evident back in 
elementary calculus, where differentiation is unbound- 
ed, but integration is bounded. 

The axiomatic study of algebras of bounded operators 
is included in the highly developed theory of “Banach 
Algebras”. This subject is largely the work of the famous 
Israel Moiseyevich Gelfand and his collaborators. (They 
used the name “normed rings” for their theory, which 
later became “Banach algebras”) The different “func- 
tional calculi”, including inhnite series, which we devel- 
oped for matrix algebras, are true more generally, for 
various other Banach algebras. 

In order to understand unbounded operators, such 
as the differential operators associated to initial and 
boundary value problems, it is necessary to make ad- 
ditional hypotheses, such as symmetry, or having com- 
pact inverses. The existence of a bounded semi-group 
- that is, the solvability of the initial value problem - is 
a powerful tool. The Hille-Phillips functional calculus 
which we presented for matrices is available in the in- 
hnite dimensional context. So is the Riesz-Dunford cal- 
culus, based on Cauchy s integral formula. Diagonaliza- 


tion works for the important special class of symmetric 
compact operators. 

The spectra of unbounded operators can be more diffi- 
cult to manage. No longer discrete sets of eigenvalues. 
But most of the tools we met in studying matrices are 
major weapons in dealing with unbounded operators 
on inhnite dimensional spaces. Diagonalization, and 
the Riesz-Dunford and Hille-Phillips representations, 
are chapters in these advanced topics. The facts that 
were presented in Section 2, on functions of matrices, 
carry over, with due precautions, to the inhnite-dimen- 
sional function spaces of linear partial differential equa- 
tions. Inhnite series works for bounded operators, in 
Banach algebras. 

Final Methodological Musings 

We have surveyed a handful of different problems, each 
of which is usually expounded in isolation from other 
examples, and hopefully we have learned to see the op- 
erational method in great generality. This honest work 
entitles us to enter the realm of philosophical musing, 
and ask “What are we talking about, when we say cer- 
tain formulas are meaningless, and others are meaning- 
ful or legitimate?” 

If Heavisides formulas about D seem “meaningless”, 
then to a child in elementary school, the associative 
laws of addition and multiplication also seem meaning- 
less. To the naive beginner, it is the nouns, the numbers, 
that are “real”, while the verbs, the operations, are mean- 
ingful only when in actual operation. As Anna Sfard 
has emphasized, reihcation, the move from a verb to a 
noun, is intrinsically difficult. Students complain that 
the reihcation of a verb is “too abstract.” 

When, with 01iver Heaviside, we write down a mean- 
ingless ‘Traction,” 1/(2 — D ), or when,with a 5-by-5 
array of numbers called A, we write down e , we are 
attempting to extend the domain of dehnition of divi- 
sion or exponentiation And we want the extended func- 
tion to still be “the same” function in the new domain, 
to be “just like it was” in the old domain. What does that 
“mean”? It means Peacocks principle of “equivalence of 
permanent forms”. The extended function should still 
satisfy “the same” conditions and formulas. For the 
reciprocal function, for example, it means that “the re- 
ciprocal of a product is the product of the reciprocals”. 
For the exponential, it means that “the exponential of a 
sum is the product of the exponentials”. Earlier in your 
schooling, you went through this same process of en- 
larging a domain. In the 3 rd or 4 th grade you started to 
“learn fractions.” Its easy to understand adding frac- 
tions. However, you also have to multiply them! 

Dividing by 2 becomes “multiplying” by Vi. Until now, 
multiplying was just multiplying by a natural number, 
which meant adding the same thing up a certain num- 
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ber of times. Multiplying by a fraction? That doesnt 
“mean anything”. Yes of course, we say “half of six is 
three”, but now we change that to “one half times six 
equals three.” How does “of ” get changed to “times”? 

These worries are not discussed in class. Just copy the 
teacher and the textbook, until you get used to it. Look- 
ing back on that now, we see that the “times” operation, 
multiplication was being extended from an old domain, 
the natural numbers, to a new one, the rational num- 
bers. The rules for multiplying by fractions are com- 
pelled to be what they are, in order to preserve the ex- 
isting structure, to make the rules for natural numbers 
continue to be valid for fractions. Peacocks principle is 
not a lucky surprise, it is the guiding principle by which 
we carry out the extension. 

When we meet matrix multiplication, it is certainly 
clear that the “product” of two square matrices is not 
the same thing as ordinary numerical multiplication. 
By what right do we call it by the same name, “times” 
or product? In other words, how do we know that it is 
an extension of ordinary multiplication, that it follows 
all the same rules? In fact, at the beginning of the line- 
ar algebra course, it is necessary to check that matrix 
multiplication is associative, and distributive (not com- 
mutative). 

The process of extending domains is overt in a complex 
variables course. The zeta function is first defined by 
a series convergent in the right half- plane, and then 
extended by means of a functional equation that the 
function satisfies. When we extended the exponential 
function to the solution operator of a well posed ini- 
tial-value problem, we did have to prove that this ex- 
tended function satisfies the multiplicative property of 
the exponential. Fortunately, the proof is just a few lines, 
given the status of the solution operator. 

Theorem 2: Let P[f\{x ), • • • , f n { x )\ > be a pol- 
ynomial in the n variables fj(z) where fj(z) are 
analytic functions of 2 ; in some domain that con- 
tains some interval a < X < b of the X axis. If 
on the interval those functions satisfy an identi- 
ty P[fi(x)g- ■ ■ ,fn{x)\ = 0, a < x <b, 
then for all z in the domain it is true that 
P[fl(z), ■ ■ • ,fn(z)] = 0- (From ChurchUl, 
Complex variables and applications , McGraw Hill, 
1966, page 262.) 

While this theorem seems to be quite general, and suf- 
fices for the examples in Churchills book, we need to 
also allow division in order to include the resolvent 
function. But in order to include division, we must im- 
pose an appropriate condition to avoid dividing by zero. 
Not only multiplication and division of functions, but 
also composition of functions observes permanence. 
But the principle breaks down for square roots and oth- 
er multi-valued functions. 


In elementary ODEs, we freely write p(D) to mean a 
polynomial with real coedicients, with, as a “variable”, 
the differentiation operator. “Multiplication” means 
successive application of a linear operator. We have 
extended the domain of the polynomial function from 
numbers to differential operators, because the differen- 
tial operators follow the same rules as numbers. There is 
an amusing reversal of role here! 

Differential operators are things that operate on func- 
tions. D doesnt “mean” anything by itself, it only 
“means” something when you give it a function f to 
operate on, Df. On the other hand, for some /s, 
in particular for polynomial functions p(%), we also 
write down and understand the result of doing it in 
the opposite order. p( m ) transforms one operator D 
to another one, p(D). Instead of operators operating 
on functions, we have operators being operated on by 
functions. 

Two of my own Papers are in 
this Symbolic Method! 

In conclusion, I make a confession. In writing this very 
article, I noticed for the first time that in the 1970 s I em- 
ployed the very same “operator calculus” or “symbolic 
method” that I am now preaching. 

Two articles are superficially separate and unrelated. 
One is called “The Method of Transmutations.” It shows 
how to use the solution of one operator problem to get 
the solution to another problem involving the same op- 
erator The second article, co-authored with Tosio Kato, 
introduced an infinite number of highly accurate ap- 
proximation methods for linear initial-value problems.. 
These formulas are now called “rational approximations 
to semi-groups.” We used rational approximations to 
the complex-valued exponential in order to approxi- 
mate the operator-valued exponential (“strongly con- 
tinuous semi-group”). 

A “transmutation” is a transform that takes the solu- 
tion to one initial or boundary value problem in- 
volving an operator A, and yields the solution to 
a different problem involving that operator. For in- 
stance, we might have Ut — Au and V 2 = A 2 V or 
Utt + aut + bu — A 2 u, and evtt + avt — Av. 

This transmutation article is not a normal math re- 
search publication. There is no theorem. Nothing is 
claimed to be proved! A collection of formulas, never 
previously brought together, are shown to be examples 
of the transmutation procedure. These examples show 
you how to find a transmutation formula. Just take away 
the operator A, and replace it with multiplication by a 
real or complex number W. If, as in all these examples, 
the solutions to these two “concrete” problems can be 
related by a known transform formula, voila! Just plug 
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in your operator A in place of W, and you have your 
transmutation! (This is what we did up above in Sec- 
tion 2 to dehne an analytic function of a matrix—we 
“plugged in” the matrix for the real or complex number 
in the standard formula.) Transmutation is a very broad 
generalization of the usual applications of the Fourier or 
Laplace transform. 

A transmutation may turn a singular perturbation 
problem into a regular one. Or Problem I could be a 
complicated concrete problem from an applied area, 
while Problem II could be a standard problem solved in 
every introductory text book. Problem I could involve 
conhdential or secret data or information, or empiri- 
cal date known only approximately. The dithculties or 
peculiarities of a singular or empirical operator-theo- 
retical problem may be taken apart. with the singu- 
larities brought down to real or complex analysis, and 
the operator-theoretic complications shifted over to a 
known standard problem. Carroll and Showalter used 
this transmutation method to study operator equations 
generalizing the singular Darboux equation that gov- 
erns spherical means. 

My second self-reference, the one about “rational ap- 
proximation to a semi-group”, came about simply by 
noticing that the standard approximation schemes—the 
hrst-order implicit scheme used by Hille and Kato, and 
the second-order scheme called “Crank-Nicholson”— 
were both merely the simplest examples of approximat- 
ing the exponential function by rational functions - ra- 
tios of polynomials - rather than purely by polynomials, 
as in Taylor series. Contemplating Hille and Crank-Ni- 
cholson, I wondered, “Why stop there?” In fact, there 
is no barrier to cubic or quartic accuracy, or as high as 
you like! 

At hrst it may seem laborious to divide by a polynomial. 
But No! Heaviside’s elementary trick of partial fractions 
saves the day. Just rewrite the polynomial divisor as a 
sum of inverses of linear functions. Values of the “resol- 
vent function”, in other words. 

The ratios of polynomials, when applied to the hrst-dif- 
ference operator, become combinations of forward and 
backward diherences, also called “implicit” and “explic- 
it” diherences. The use of implicit or forward diherences 
corresponds to division instead of multiplication by the 
basic diherence operator. This is necessary in order to 
keep the iterations bounded. This necessary condition 
is called “stability” in approximation theory. 

For polynomials of degree n in the numerator and de- 
gree m in the denominator, the best possible approxi- 
mation to the exponential hmction is of order n + m, 
and is given by the Padĕ table. I used the Riesz-Dunford 
calculus for the proof, but Tosio Kato was able to get 
sharper estimates using the Hille-Phillips operational 
Laplace transform that I discussed above in Section 2. 


Thomĕe and Brenner then proved an optimal estimate 
conjectured by Kato. More recently, Patricio Jara and 
others at Louisiana State University extended these re- 
sults in several signihcant ways. 

A worked-out computation offered in Jara (2008) is 
startling in its orders-of-magnitude superiority over 
the standard method. Specihcally, the reader is referred 
to Figure 2 of this paper; his graphs of the comparative 
errors. You will notice that the scales of the two graphs 
differ by 5 orders of magnitude! 
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Ramanujan: A Glimpse of His 
Life and His Mathematics 

Bĕla Bollobas 


(This article is based on talks given to the Trinity Mathematical Society on 9th June, 1987, and to the Adams Society on 19th January, 1988). 

T he history of mathematics is rich in exciting and peculiar geniuses. But none was more extraordinary than Srini- 
vasa Ramanujan, the great Indian mathematician. In vague terms his romantic but tragic life is quite well known, 
but the details are shrouded in mystery even now, and most mathematicians would be hard pressed to state even 
one of his theorems. In this brief note we shall sketch the story of his life, including some hitherto less known facts, and 
give a short introduction to the kind of mathematics he did. 

In the 19th century mathematical analysis in England was in a sorry state: one the Continent, especially in France and 
Germany, revolutionary advances were made, while England failed to produce any analyst of hrst rank. However, by the 
1930s England could boast of a Aourishing school of analysis, mostly due to the efforts of two outstanding mathemati- 
cians, G.H. Hardy and J.E. Littlewood. 

The Hardy-Littlewood partnership, perhaps the greatest scientihc partnership ever, dominated English mathematics for 
several decades. Hardy, born in 1877, was the older, by 8 years. Their collaboration began in 1911 and their last joint 
paper was published in 1948, a year after Hardy s death. For most of this period they were fellows of Trinity College, 
Cambridge. 

In 1913 Hardy received an unsolicited letter from India. 

Madras 16th January 1913 

Dear Sir, 

I beg to introduce myself to you as a clerk in the Accounts Department of the Port Trust Of- 
hce at Madras on a salary of only £20 per annum. I am now about 23 years of age. I have had no university 
education but I have undergone the ordinary school course. After leaving school I have been employing 
the spare time at my disposal to work at Mathematics. I have not trodden through the conventional regular 
course which is followed in a university course, but I am striking out a new path for myself. I have made a 
special investigation of divergent series in general and the results I get are termed by the local mathemati- 
cians as ‘startling’ 

... I would request you to go through the enclosed papers. Being poor, if you are convinced 
that there is anything of value I would like to have my theorems published. I have not given the actual 
investigations nor the expressions that I get but I have indicated to the lines on which I proceed. Being 
inexperienced I would very highly value any advice you give me. Requesting to be excused for the trouble I 
give you. 

I remain Dear sir Yours truly 

S. Ramanujan 

P.S. My address is S. Ramanujan, Clerk Accounts Department, Port Trust, Madras India. 

The ‘enclosed papers’ contained a long list of results claimed by the writer. Apparently this letter was read by Hardy and 
Littlewood after hall. Some of the results were well known and some they could prove, although not too easily, some were 
clearly incorrect and some could be proved to be incorrect, but several looked correct and still bahled them entirely. The 
entire approach was permeated by a certain naivete, as remarked by Littlewood. “I have a vague theory as to how his 
mistakes have come about. I imagine that he is satished if he can convince himself that his results are correct & he was 
probably staked on certain operations on dgt. [divergent] series involving primes being legitimate. His results are just 
about what one wd [would] get if C( 5 ) had no zeros in (J > 0.” 

Although the basic mistakes committed by this unknown Indian seemed very peculiar to Hardy and Littlewood, the 
beauty and strength of many of the other results claimed were indeed startling. To quote Littlewood again, “... this ap- 


39 




pears to be correct, though I haven’t verified all the algebra. Not vy [very] exciting of course, though I shd [should] say 
rather a tour de force without Cauchy s theorem” “17 is an exp [expression] for 0(0, \/210) isnt it? Surely this & 20-23 
are new & if so, vy [very] exciting? I can believe that hes at least a Jacobi. How maddening is letter is in the circumstances. 
I rather suspect that hes afraid that you’11 steal his work.” 

As this passage shows, Hardy and Littlewood came to the conclusion that the writer of the letter was not a lunatic, nor 
was he just an amateur, but a man of most unusual talent. To be compared to Jacobi, the great German master of formulae 
was tremendous praise indeed. And to have arrived at that level without any professional training was quite unbelievable. 

So who was the author of this letter? Srinivasa Ramanujan was born into a very poor Brahmin family in 1887, probably 
on 22nd December, in Erode, south of Madras. [This is his accepted date of birth, although it is a little peculiar that he 
himself wrote in January 1913 that he was about 23, and later, in the Admissions Book at Trinity College, he gave his date 
of birth as December 1888.] His father was an accountant and his mother was an intelligent woman, who was literate in 
Tamil and also knew some Hindu astronomy. As he was extremely poor, Ramanujan was educated as a free student in 
the High School at Kumbakonam. 

While in this school, e came across a rather peculiar book, entitled A Synopsis of Elementary Results in Pure Mathemat- 
ics , published in two volumes in 1880 and 1886. This was the work of a certain George Shoobridge Carr, a private coach 
in London and formerly Scholar of Gonville and Caius College, Cambridge, and contained some 6000 assertions with 
abridged demonstrations! Carr was a rather peculiar man himself, who came to Cambridge to read mathematics when 
he was nearly forty, and in 1880, the year he published the first volume of his Synopsis , was 12th Senior Optime [12th in 
the Second Class]. Nevertheless, the book showed signs of real scholarship and a glowing love of mathematics. 

Today, the only claim to fame of Carrs Synopsis is that it left a profound impression on Ramanujan, Ramanujan set 
himself the task of establishing all the formulae in the book and after this mammoth enterprise he never ceased to excel 
in finding beautiful and unexpected formulae. This extraordinary feel for formulae and formal manipulations was to be 
his main strength. 

Ramanujan passed the Matriculation Examination of the University of Madras in 1903, and was awarded a scholarship 
at Kumbakonam. Unfortunately his smooth academic progress did not last long. In 1904 he became temporarily insane, 
left home and for months on end wandered about, getting 700 miles from home. Later he couldn t remember how he had 
lived. In 1907 he sat for the First Examination in Arts but failed in all subjects, including mathematics. Two years later he 
had a very serious operation that kept him in bed for months. But he continued working on mathematics. 

In 1909 Ramanujan married Srimathi Janaki, a 9 year old girl. In order to support himself and his wife, he became a clerk 
in the Port Trust Office in Madras. In his spare time he started genuine research in mathematics, working on elliptic 
functions and elliptic integrals, continue fractions and the distribution of primes. 

Hardy was not the first man in England approach by Ramanujan. First he wrote to H.F. Baker and then to E.W. Hobson, 
but they ignored his letters. In later years Littlewood often chuckled over the embarrassment of his colleagues who 
failed to recognise a genius. But Hardy did, and a long, although occasionally strained correspondence followed. Firs the 
University of Madras granted Ramanujan a special scholarship of £60 per annum for two years at the instance of Sir Gil- 
bert Walker, Head of the Meteorological Department in India, formerly Fellow and Lecturer in Mathematics at Trinity 
College. This sum was sufficient to enable a married Indian to live in tolerable comfort. Later Hardy, and another Trinity 
mathematician, E.H. Neville, managed to bring Ramanujan to Cambridge. Ramanujan got a very generous scholarship 
of £250 from madras, of which £50 was allotted to support his family in India, and an exhibition of £60 from Trinity. 

Leaving India was a momentous step for Ramanujan: as a Brahmin, he was not supposed to do so. However, eventually 
Ramanujan obtained the permission of his mother, got the consent of Ramagiri, his favourite Indian goddess, cut his 
hair, and on March 17, 1914, in the company of Neville, sailed for England, leaving his young wife behind. 

Ramanujan was not very well prepared for Cambridge and arrived at a rather unfortunate time. In many ways, he had 
less mathematical sophistication than most undergraduates today. As Littlewood wrote later, “the clear-cut idea of what 
is meant by a proof, nowadays so familiar as to be taken for granted, he perhaps did not possess at all; if a signiticant 
piece of reasoning occurred somewhere, and the total mixture evidence and intuition gave him certainty, he looked no 
further.” He hardly knew Cauchys Theorem, and practically never used it. Although Ramanujan considered his work on 
the classical theory of numbers very important, he knew very little of the general theory of arithmetical forms and even 
less about the theory of analytic functions, in particular, about the zeta function of Riemann, upon whose properties the 
theory of primes depends. However, in formal manipulations Ramanujan was truly exceptional. To quote Hardy, “It was 
his insight into algebraical formulae, transformation of infinite series, and so forth, that was most amazing. On this side 
most certainly I have never met his equal, and I can compare him only with Euler or Jacobi. He worked, far more than the 
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majority of modern mathematicians, by induction from numerical examples; all his congruence properties of partitions, 
for example, were discovered in this way” 

Ramanujans teaching in Cambridge was to be shared by Hardy and Littlewood, but the War soon broke out and Little- 
wood left Cambridge. Ramanujan worked with great enthusiasm, having found a friend and teacher in hardy, on some 
days talking half a dozen new theorems to him. As Hardy wrote, “His mind had hardened to some extent, and he never 
became at all an orthodox’ mathematician, but he could still learn to do new things, and do them extremely well. It was 
impossible to teach him but he gradually absorbed some new points of view.” 

In Cambridge Ramanujan led a very irregular life. As a rather strict Brahmin, he was not only a vegetarian but had to 
do his cooking himself. He worked feverishly, occasionally putting in 30 hours at a stretch and then sleeping for 20. He 
never ceased to be exceptionally productive. 

Unfortunately, in the spring of 1917 Ramanujan began to feel unwell. That summer he went to the Nursing Home in 
Cambridge, and from then on he was hardly ever out of various sanatoria. His illness was never clearly diagnosed: per- 
haps he had tuberculosis, as most commonly believed, or perhaps he had lead poisoning from the food carelessly left 
in his cooking utensils, as asserted by the late Professor P.A.M. Dirac. He was certainly undernourished, suffered from 
the cold and damp climate, and was a bit of a hypochondriac. He went from one sanatorium to another, arriving full of 
hope and trust, becoming disappointed soon and then trying desperately to get away. But wherever he stayed, he did 
mathematics with great enthusiasm. 

One must sympathise with Ramanujan for there is no doubt that he suffered much. His letters to Hardy are almost exclu- 
sively about two topics: mathematics and his health. His longest stay was in the sanatorium at Matlock, where he arrived 
in November, 1917. Let me quote a passage from a letter of Ramanujan to hardy written from Matlock. 

I have been here a month and I have not been allowed hre even for a single day. I have been 
shivering from cold many a time and have not been able to take meals sometimes. In the beginning I was 
told that I could not possibly have any except the welcome hrst I had for an hour or two when I entered this 
place. After a fortnight of stay they told me that they received a letter from you about one and promised me 
hre on those days in which I do some serious mathematical work. That day hasn t come yet and I am left in 
this dreadfully cold open room. Even if I do any serious mathematics in future I am not going to ask them 
for hre on that day. 

The bath rooms are nice and warm. I shall do to the bath room with pen and paper every day 
for about an hour or so and send you two or three papers very soon. This thought did not strike me before. 

Else I would have written something already. In a week or so you may perhaps have a complaint against me 
from the doctor that I am having a bath every day. But I assure you beforehand that I am not going to bathe 
but to write something. 

Although separate from her by several thousand miles, Ramanujan had trouble with his mother as well. Not surprisingly, 
she did not realise that her son was a genius, and that a Cambridge degree was far from being a great achievement for 
him. Also, it seems that she disapproved of all communications between Ramanujan and his wife. Here is a passage from 
another letter to Hardy from Matlock. 

It is true that I promised my mother that I was going home at the end of 2 years; I wrote them 
several letters Vh years ago that I was coming over there for the long vacation; but I had many letters of 
protest from my mother to the effect that I ought not to come to India till I took my M.A. degree. So I have 
up the idea of going there. 

It is not true that I am getting letters from my wife or brothers-in-law or anybody. I had only 
very few formal letters from my wife just explaining to me why she had to leave my home and asking me 
why my mother should trouble her by [not] speaking to her when they met anywhere by chance. When I do 
not know the whereabouts of my wife my mother s grievance is that I have left my wife in some secret place 
somewhere in India and that she is waiting for me frequently to come to that place without my mother s 
knowledge and that I am listening to her words all the time... 

The initial S. in my name stands for Srinivasa which is my father s name. I haven t got a sur- 
name, really speaking. 

In the autumn of 1917 Hardy put Ramanujan up for a Fellowship in Trinity College. However, he was strongly opposed 
by some fellows, probably mainly because of the colour of his skin, and failed to get elected. Hardy and Littlewood would 
not give in. they Ramanujan up for a Fellowship of the Royal Society: a considerably higher honour than a Fellowship in 
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Cambridge. Although this honour was richly deserved by Ramanujan, it was nevertheless a little premature since Little- 
wood himself had got elected only a year earlier. In February 1918 Ramanujan was duly elected an F.R.S. and so became 
the hrst Indian member of that illustrious society. As was can see from his letter to Hardy, he was overjoyed at the news. 

Dear Mr. Hardy, 

My words are not adequate to express my thanks to you. I did not even dream of the pos- 
sibility of my election. When I opened your telegram I read thrice Fellow Philosophical Society instead of 
Royal Society.... Please convey my heartfelt thanks to Major MacMahon and Mr. Littlewood. 

Later than year Hardy and Littlewood did manage to secure a Fellowship for Ramanujan in Trinity, but the election was 
still not straightforward. Many years later Littlewood wrote about the hght concerning the election. “There was much 
opposition. Hardy was not made an Elector, and I acted, by letter, because I was quite ill (after concussion on the top 
of years without proper holidays from work). I did get wind of the enemy s tactics from R.A. Herman, who was a close 
personal friend, and although he was against Ramanujan himself, he was always naively honest. I said: ‘You cant reject 
an F.R.S’ ‘Yes, we thought that was a dirty trick!”’ 

The Fellowships of the Royal Society and Trinity College, together with the hnancial security the latter entailed, seemed 
to restore Ramanujans health. On 27 February 1919 Ramanujan sailed for home. Unfortunately the recovery did not last 
long. In spite of the more favourable climate and better diet, he soon fell ill again and died on 26 April. 

No matter how fascinating and romantic Ramanujans life was, his greatness lies in his mathematics. Although many of 
Ramanujans mathematical achievements are easier to appreciate than the results of most other great 20th century math- 
ematicians, in a short article we cannot hope to do more than scrape the surface. To illustrate Ramanujans unsurpassed 
mastery of formulae, we shall give two examples. The hrst is a very simple identity, while the second is, in fact, a brief 
introduction to a subject, the theory of partitions, and leads on to much deeper results. 

Let us see then the hrst example, an expression for the Euler constant 
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Much of Ramanujans most beautiful work was on the theory of partitions. This is a branch of mathematics that, in its 
elementary form, can be appreciated by everybody interested in mathematics. However, at a more advanced level it is 
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Ramanujan gave the following expression for 7 

7 = log2-p+-2( 2 


-3 


3 3 —3 
2 


15 3 —15 + 18 3 —18 


6 3 —6 
2 


+ 


+ 


42 















also connected to some deep areas of mathematics, including the theory of modular forms, and has many applications in 
analysis, number theory, combinatorics and even physics. 

A partition X = (Ai, . . . , A r ) of Ti is a non-increasing sequence of positive integers Ay, A 2 , . . . , A r such that 
A]_ + A 2 + • • • + A r = n. The number ofpartitions of Tl is denotedby p(jl)\ we call p(n) the partition function. 
Thus p( 1) = 1, p(2) = 2, _p(3) = 3, p( 4) = 5, p( 5) = 7, p(6) = 11 and p(7) = 15. By convention 
onetakes^(O) = 1. 

The generating function f ( q ) for a sequence (2o, CL \, • • . is the formal power series 

00 

/(?) = Y anqU ’ 

n= 0 

the hmction /( q ) is said to enumerate the sequence (tt n ). The power series one tends to consider are convergent for 

\q\ < i if 

00 

Y anqn > 

n= 0 
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determines the sequence (tt n ). 

Note that we have the following identity of formal power series: 
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Indeed, the coethcient on the right-hand side is precisely the number of ways we can select non-negative integers 
such that 

n 

Uii — n, 
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and these selections correspond to partitions of n. Every factor on the right is convergent for \q\ < 1 and so is the 
entire product. Hence 
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for \q\ < 1. As all similar identities are claimed to hold for the formal power series and for \q\ < 1, the condition 
\q\ < 1 is usually omitted, although it is assumed in all relations. 

In addition to the partition hmction p(n) itself, one studies restrictedpartition functions. For a set U of partitions and 
n G N, let U ( n ) be the set of partitions of n that belong to U and set p(U, n) = | U(n) \. We shall consider the 
restricted partition function p(U , n) for a number of sets U. For example, O is the set of odd partitions, D is the 
set of partitions into distinct parts, P Q is the set of partitions into an odd number of parts, P e is the set of partitions 
into an even number of parts, D 0 is the set of partitions into an odd number of distinct parts, and D e is the set of 
partitions into an even number of distinct parts. Thus A = (A^)x belongs to O if each A{is odd and it belongs to D 

if Ai > A 2 > * * * > X r . 

It is ohen helphil to represent partitions by their diagrams or graphs (see Figure 1). These diagrams, called Ferrars graphs 
or diagrams, were invented by N.M. Ferrars and hrst appeared in print in a paper by J.J. Sylvester in 1853. It is ohen 
convenient to identify a partition with its Ferrars diagram. 
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The great Swiss mathematicin Leonhard Euler started the study of partitions and the basic results were proved by him 
around 1740 and by Carl Gustav Jacob Jacobi about a hundred years later. Many of these results are fascinating and sur- 
prising identities of the kind p(U,n) — p(U'\n) for all n , where U and U' are certain sets of partitions. 

There are two natural ways open to us in proving a partition function identity: we may use function theory, relying on 
the generating function or the partitions, or we may use combinatorics, relying on the Ferrars diagrams of the partitions. 
We shall illustrate both approaches in proving some of the classical identities. 

Theorem 1: The number of partitions of U into exactly 171 parts is equal to the number of partitions of n in which the 
largest part is 171 . The number of partitions of n into at most 171 parts is equal to the number of partitions of n into 
parts not greater than 171 . 

Proof: Given a partition A = (A^) ^ of n, let S( A) be the partition of n whose Ferrars diagram is obtained from the 
Ferrars diagram of A by reflecting it in the main diagonal as illustrated in Figure 2. 



It is immediate that S sets up a 1-1 correspondence between the appropriate sets: A has exactly 171 parts if and only if 
7n is the largest part of S( A). 

Now let us give a simple example of a partition identity whose proof is immediate if we use generating functions. This 
identity was proved by Euler in 1748. 

Theorem 2: The number of partitions of n into odd parts is equal to the number of partitions of n into distinct parts. 
Proof: What this result claims is that p(0,n ) = p(D, n) for every n. Note that 
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where the last identity followed by cancelling the factors 1 — q . 

An identity fundamental in the theory of partitions is Eulers pentagonal number theorem. Just as the triangular num- 
bers k(k — 1)/2 count the number of elements in a triangular array (a Pascal triangle), the pentagonal numbers 
k(3k — 1)/2 count the number of points in a pentagonal array (see Figure 3). 


• • 


I i 

• • 


• • 
• • 


• • • • • 
• • 

• • 

• • • • 

Figure3: Representations of 6(6 — l)/2 = 15 and 4(12 — l)/2 = 22. 


As we shall see, the pentagonal number theorem makes it possible to give a useful recurrence relation for 

p(n). 

Theorem 3: 


1(1 -q n ) = 1 -9-9 2 + 9 5 + 9 7 




k k(3k+l)/2 


n =1 


k =—00 


Proof: Let 


Y[(l-q n ) = J2cnq n . 

n =1 n=0 

The theorem claims that c n = 0 unless n is k{3k — 1)/2 or k(3k + l)/2, in which case it is ( — l) k . 
What is c n in terms of partitions? Clearly c n = p(D e , n) — p(D 0 , n) because 

00 00 

na+o=E p(°-")9 n , 


n =1 


n =0 


is the generating function for distinct partitions, and a partition of n into m distinct parts contributes 
(— l) m to c n . thus we need an almost 1-1 correspondence between D e (n ) and BLANK. We shall obtain 
such a correspondence by transforming the Ferrars diagram. 

Given A G D, the slope s( A) of the Ferrars diagram of A is the maximal set of endpoints forming an inter- 
nal sloping by 45° and containing the endpoint of the first line. The base b( A) is the set of points in the last 
line (see Figure 4). 



Let us try two operations on (the Ferrars diagram of) a partition A G D . In operation @ put the base next 
to the slope and in operation cr move the slope underneath the base. Call /3 legal if /3( A) G D and call cr 
legal if cr(A) E D (see Figure 5). 
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• • 
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cr(X) 


Figure 5: The operations (3 and cr on A = (5,3, 2), with /3 illegal and cr legal. 


/3 is legal on A if |6(A)| < |s(A)| or if |6(A)| = |s(A)| and b( A) H s( A) = 0 . Similarly, a is legal on A 
if |s(A)| < |6(A)| — 1 or if |s(A)| = |6(A)| — 1 and b( A) D s(A) = 0 . 

Note that if /3 is legal on A G D then a is legal on /3(A) and a(/3(X)) — A. Also, if a is legal on A G D 
then /3 islegalon cr(A) and /3(a(X)) = A. Furthermore, /3 and a change that number of partitions by one 
so, if they are legal, they map an element of D e into an element of BLANK, and an element of BLANK into 
an element of D e . 

When, then, is neither /3 nor a legal on A G D(n )? If either |6(A)| = |s(A)| and 6(A) D s(A) = 0 
or [J(A)| = |6(A)| — 1 and b(X) D s(A) = 0 . Setting S = |s(A)| we see that in the first case we must 
have 7T = 5 + (5 + l) + -- - + (2s — l) = 5(35 — 1)/2 and A is unique, and in the second case 
n — ( s 3~ 1) 3- (s 2) + + 25 — s(3s + l)/2 an j ^ is a g ain un ique (see Figure 6). Therefore if 

n s(3s + 1)/2 then C n — p(D e , n) — p(D 0 , n) = 0 since a combination of /3 and a sets up a 1-1 corre- 

spondence between D e and BLANK. Furthermore, if U 5 (^ 5 then C n = ( — l) s because the‘odd man 

ouf, the exceptional partition A G D(n) on which neither /3 nor a is legal, belongs to D e if S is even and to BLANK 
if S is odd. 



Let us rewrite the pentagonal number theorem in a form that gives the promised recurrence formula for p(n). 
Corollary 4: 

00 

(1 -<!-<? + <?+ q 'r-...)'£ /P (n)q n = l, 

n —0 

i.e. p(n) = p(n — 1) + p(n — 2) — p(n — 5) — p(n — 7) + • • • . 

This recurrence formula enables one to calculate the first few values of p(n) rather rapidly. Major MacMahon, one of 
the pioneers of enumerative combinatorics, used this formula to calculate the first 200 values of p(n). Let us state with- 
out proof another classical result, Jacobis formula. 

Theorem 5: 
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{(l-q n )} 3 = l-3q + 5q 3 -7q 6 + --- , 

. OO 

= \ E (-l) fc (2fe + l)c? fc(fc+1)/2 , 

k=—oo 

oo 

= J2(~ 1 ) k ( 2k + l)q k{k+1)/2 . 

k =0 


In 1918, from the sanatorium at Matlock, Ramanujan sent a letter to Hardy. “Dear Mr. Hardy, Will you please verify from 
thetableofp(n) ( D p(n)x n = (1 - x) _1 (l - X 2 ) -1 . . .) the following? 

(i) p( 4), p(9), p(14), _p(19), - . . (comm. diff. 5) are all divisible by 5, 

(ii) p(5), p(12), p(19), p( 26),... (comm. diff. 7) are all divisible by 7, 

(iii) p( 6 ), p(17), p( 28), p( 39 ), - - - (comm. diff. 11 ) are all divisibleby 11 . 

If these are true (especially (iii)) then ...” 


Tbe conjecture that followed, called, rather characteristically, a theorem by Ramanujan, did not get into the monumental 
partition paper Hardy and Ramanujan were writing at the time. However, Ramanujan did publish a proof of various 
results supporting this conjecture. To present a proof of the simplest of these results, we need a lemma. Let us write 
anQ n = b nQ n (mod m) if a n = b n (mod m) for every n 

Lemma 6: 


1 


i 

1 — q 5 


(mod 5). 


Proof: The result is immediate from the Binomial Theorem. 


Theorem 7: p(5m + 4) = 0 (mod 5). 
Proof: Note that, by Jacobis formula 


no-o no- 
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When is an exponent 7 (fc, l) = 1 + k(k + 1)/2 + 1(31 + 1)/2 divisible by 5? Precisely when 
87 (fc, l) — 10Z 2 — 5 = (2 k + l) 2 + 2(1 + l) 2 = 0 (mod 5). Now modulo 5 the hrst term is congru- 
ent to 0, 1, or —1, and the second term is congruent to 0, 2 , or — 2 , so 7 (A), l) = 0 (mod 5) if, and only if, 
2k + 1 = 0 (mod 5) and l + 1 = 0 (mod 5). In particular, the coeihcient of q^ m in 

4na-oV. 


is a multiple of 5. Hence, by Lemma 6, the coefhcient of q^ m in 
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is also divisible by 5. Note that a n = p(n — 1) so < 25771+5 = p(5m + 4) . We claim that each a^ m is also divisible 
by 5. Indeed, if 772 > 1 were the smallest 772 for which not a^ m = 0 (mod 5) then 


m—1 

^5m ^5m “ 1 “ ^ ^ ^ra—&+5/c> 

k =1 


is not = 0 (rnod 5), which is a contradiction. Hence p(5m + 4) = < 25771+5 . 

In fact, Ramanujan stated without proof two remarkable related identities, proved later by Darling and Mordell. The hrst 
of these was selected by Major MacMahon as the most beautiful identity discovered by Ramanujan. One need not be an 
expert in the held to appreciate its exotic beauty. 


Theorem 8: 
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p(5m + 4)g m = 5 < jj(l - , 


Y 5n\ 
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The full conjecture of Ramanujan turned out to be false and so it was modihed. In 1967 Atkin proved this modihed 
conjecture. 

Theorem 9: If d = 5 a 7 b ll c and 24 1 = 1 (mod d) then p{l) = 0 (mod 5 a 7L( 6 + 1 )/ 2 J + 1 ll c ). 

Note that in Theorem 7 we had d = 5 and l = 5772 + 4. 

It is a little strange that perhaps the most famous of all identities discovered by Ramanujan are the Rogers-Ramanujan 
identities. According to Hardy, these formulae are ‘ certainly as remarkable as any which even Ramanujan ever wrote 
down”. The discovery of these identities had been anticipated by a much less famous mathematician, L.J. Rogers. Ra- 
manujan discovered these identities some time before 1913. He sent them to Hardy, who was very impressed by them, 
but could not prove them. Although Hardy asked several mathematicians about the formulae, nobody he asked could 
prove them or hnd a reference. While in Cambridge, Ramanujan attacked them again, but to no avail. 

In the volume of his Harvard lectures (p.91), Hardy recalled what happened next. “The mystery was solved, trebly, in 
1917. In that year, Ramanujan, looking through old volumes of the Proceedings of the London Mathematical Society, 
came accidentally across Rogers’ papers. I can remember very well his surprise, and the admiration which he expressed 
for Rogers’ work. A correspondence followed in the course of which Rogers was led to a considerable simplihcation of his 
original proof. About the same time I. Schur, who was then cut off from England by the way, rediscovered the identities 
again.” 

What is very peculiar is that Rogers’ reputation rests almost entirely on Ramanujans rediscovery of the astonishing 
identities. Although Rogers proved them in 1894, he lost interest in the held and did not draw Ramanujans attention 
to his work, perhaps because he did not even read Ramanujans papers. Here are the two Rogers-Ramanujan identities. 

Theorem 10: 

00 q n 2 °° 1 
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In fact, these identities can also be interpreted as rather natural relations concerning partitions. Let Cfci be the set 
of partitions (Ai, . . . , X r ) such that Xj ^ 0, =b i (mod 2k + 1) for all j, and let be the set of partitions 
(Ai, . . . , A r ) such that A j — Xj+k_i > 2 and at most i — 1 of the A j are 1. Then the hrst Rogers-Ramanujan 
identity states that p(C^2^ n ) = p(D 22 ? n) for all 77. Indeed, the right-hand side is precisely 

^2p(C 2 2,n)q n . 

n 

What is the left hand-side? Take A = (Ap, . . . , A m ) G D 22 (n) so that 


m 

n = + \j. 

1=1 

Then A = (2 777 — 1, 2 777 — 3, . . . , 1) + /i, where /i is apartition of n — 77i 2 into at most 777 parts. Conversely, 
if /i is any partition of n — 77T 2 into at most 77i parts then (2m — 1, 2m — 3, . . . , 1) + /7 G D 22 (jl). 


Thus D 22 is enumerated by 




m=0 


(1 - q)(l - q 2 )... (1 - q m )' 


In 1961 Gordon proved the following generalisation of the Rogers-Ramanujan identities. 
Theorem 11:Ifl<i</c then p(Cki, n) = p(Dki, n) for all n. 


Moreover, in 1981 Garsia and Milne proved a family of Rogers-Ramanujan type identities by using bijections, somewhat 
similar to the one used above in the proof of Euler s pentagonal theorem. Needless to say, the proof is much more com- 
plicated. 

Let us turn to a rather different question concerning partitions. About how large is p(n) for large values of 77? It is 
rather surprising that this had not bee asked before 1917, when Hardy and Ramanujan published the amazing memoir 
on this topic we briefly referred to earlier. 

The recurrence formula given in Corollary 4 enables us to calculate p(n) rather rapidly, provided we know the previous 
values, but it is of rather limited use when looking for an asymptotic formula. The hmction p(n) is rather unruly, it is 
certainly much more complicated than 77!, so that readers who had to prove the rather simple Stirling approximation of 
77 ! in their hrst year at university, will appreciate the following result of Hardy and Ramanujan: 


p(n) ~ (4t7\/3) 1 eyLp{Tt(2n/?>) 1 / 2 }- 


Impressive though this approximation is, it is hardly even the beginning. This formula was known to Ramanujan in 
India. But what about the error term? This is where the fun begins. Hardy and Ramanujan proved the following truly 
astonishing result. 

Theorem 12: p(n) is the nearest integer to 


(2\/2) 1 52\/5A( n )^?( n )>H 

q=l 

where 

A q (n) = Y,u Pq e~ 2nJmi/ \ 

the sum being over p s relatively prime to q and less than it, id pq is a certain 2Aq th root of unity 
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and V can be taken to be of the order of v n 


In his review of Ramanujans Collected Papers, Littlewood wrote about the extraordinary collaboration resulting in this 
amazing result. 

One of Ramanujans Indian conjectures was that the hrst term of (*) was a very good approximation to p{n) \ this was 
established without great dithculty. At this stage the X — 1/24 was represented by a plain X - the distinction is irrel- 
evant. From this point the real attack begins. The next step in development, not a very great one, was to treat (*) as an 
‘asymptotic series, of which a fixed number of terms (e.g. V = 4) were to be taken, the error being of the order of the 
next term. But from now to the end of Ramanujan always insisted that much more was true than had been established: 
‘There must be a formula with error 0(1) ’ This was his most important contribution; it was both absolutely essential 
and most extraordinary. A severe numerical test was now made, which elicited the astonishing facts about p(100) and 
^(200). Then was made a function of; this was a very great step, and involved new and deep function-theory methods 
that Ramanujan obviously could not have discovered by himself. The complete theorem thus emerged. But the solution 
of the final difficult was probably impossible without one more contribution from Ramanujan, this time a perfectly 
characteristic one. As if its analytic difficulties were not enough, the theorem was entrenched also behind almost im- 
pregnable defences of a purely formal kind. The form of the function (jl) is a kind of indivisible unit; among many 
asymptotically equivalent forms it is essential to select exactly the right one. Unless this is done from the outset, and the 
—1/24 (to say nothing of the d/dx) is an extraordinary stroke of formal genius, the complete result can never come 
into the picture at all. There is, indeed, a touch of real mystery. If only we icnewthere was a formula with error O (1) , we 
might be forced, by slow stages, to the correct form of 4/^. But why was Ramanujan so certain there was one? Theoretical 
insight, to be the explanation, had to be of an order hardly to be credited. Yet it is hard to see what numerical instances 
could have been available to suggest so strong a result. And unless the form of 4/g was known already, no numerical 
evidence could suggest anything of this kind - there seems no escape, at least, from the conclusion that the discovery 
of the correct form has a single stroke of insight. We owe the theorem to a singularly happy collaboration of two men, 
or quite unlike gifts, in which each contributed the best, most characteristic, and most fortunate work that was in him. 
Ramanujans genius did have this one opportunity worthy of it. 

There are many other topics one should mention in connection with Ramanujan, including elliptic and modular func- 
tions, definite and indefinite integrals, continued fractions, elliptic theta functions, mock-theta functions and related q 
-series, the representation of numbers as sums of squares, and Ramanujans T -function. The last function is defined as 
follows 

00 00 

'Z,T(n)q n = q \\(l-q n ) 2A . 

n =0 n= 1 

Ramanujan conjectured that is multiplicative, i.e. r{mn) = r(m)r(n) if m and n are relatively prime, but could 
ot prove his conjecture. It was proved by L.J. Mordell, and the proof was not too difficult. Since then J.R Serre and H.P.F. 
Swinnterton-Dyer used 1-adic representations to place the whole area in a general setting, thereby proving wrong Hardy 
who, rather pessimistically, wrote that the area may be a backwater of mathematics. A much more important conjecture 

of Ramanujan, about the growth of r(n ), stating that | t(ji) | < 2for every prime p, was generalised by Pe- 
tersson and became known as the Ramanujan-Petersson conjecture. This was shown to be true by P. Deligne, when he 
proved the generalised Weil conjecture on algebraic varieties over finite fields. 

During his stay in England, Ramanujan suffered much from his illness and from the climate. Nevertheless, he never 
regretted having made the journey and considered his five years in England as the greatest experience of his life. He 
published 21 papers in Europe, five of them in collaboration with Hardy. 

A considerable body of Ramanujans results is contained in three notebooks: one left with Hardy when he returned to In- 
dia and two donated to the University of Madras upon his death. In their style these notebooks resemble the Synopsis of 
Carr read by Ramanujan with such enthusiasm. Professor B.C. Berndt has undertaken the task of editing these volumes: 
a ditRcult and arduous task indeed, since he intends to prove each of Ramanujans theorems. Yet another notebook (in 
fact, about 100 loose sheets), names the Tost notebook of Ramanujan by Professor G.E. Andrews, contains Ramanujans 
brilliant work on mock-theta functions. (The name Tost notebook’ is somewhat misleading since Andrews found it in the 
Wren Library of Trinity College, catalogues under ‘Ramanujan.) Ramanujans sketchy results on mock-theta functions 
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are among his most profound contributions to mathematics. The mathematical community owes a great debt to Profes- 
sors Berndt and Andrews for this important work on these notebooks. 

How great a mathematician was Ramanujan? Hardy thought that he could have been even greater: “The tragedy of Ra- 
manujan was not that he died young, but that, during his five unfortunate years, his genius was misdirected, sidetracked, 
and to a certain extent distorted.” This is, of course, perfectly possible. However, it is also possible that Ramanujan did 
find the field he was most suited for, in which he could make full use of his extraordinary formal powers. As Littlewood 
said, “every positive integer was one of Ramanujans personal friends”. 

Ramanujans amazing affinity to natural numbers is clearly illustrated by Hardy s well known and charming story about 
the taxicab. “I remember going to see him once when he was lying ill in Putney. I had ridden in taxicab No. 1729, and 
remarked that the number seemed a rather dull one, and that I hoped that it was not an unfavourable omen. ‘No’ he 
replied, c it is a very interesting number; it is the smallest number expressible as a sum of two cubes in two different ways.’ 
I asked him, naturally, whether he could tell me the solution of the corresponding problem for fourth powers; and he 
replied, after a moments though, that he knew no obvious example, and supposed that the first such number must be 
very large.” 

Hardy was rather fond of peculiar intellectual games like comparing mathematicians to cricketers and making up vari- 
ous elevens. In his ‘Jerusalem Eleven God was the opening batsman and Albert Einstein played at No. 3. Professor Paul 
Erdos, the prodigiously productive Hungarian mathematician who got to know Hardy and Littlewood in his early twen- 
ties, recalls Hardy s playful ratings of mathematicians on the basis of talent, the ratings going from 0 to 100. Hardy gave 
himself 25, Littlewood 30, Hilbert 80, and Ramanujan 100. 

In pure, raw mathematical talent Ramanujan is unsurpassed. 

Readers wishing to learn more about Ramanujans mathematics are advised to start by consulting the following three 
volumes. 
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500 Years of Mathematical 

Anniversaries 


1517 


Death of Luca Paci- 
oli, a collaborator 
of Leonardo di Vin- 
ci's, and "The Father 
of Accounting and 
Bookkeeping". He 
was the first person to 
publish on the dou- 
ble-entry system of 
book-keeping. 


1517 


Less than six years after 
first arriving at the Uni- 
versity, Isaac Newton is 
elected a fellow of Trin- 
ity College, Cambridge 
in June 1667. It was to 
be only a further two 
years before he would 
succeed Isaac Barrow 
as Lucasian Professor. 


1667 



1742 


Goldbach states his conjec- 
ture that every even integer 
greater than 2 can be ex- 
pressed as the sum of two 
primes. Today, it stands as one 
of the oldest and best-known 
unsolved problems in number 
theory, and all of mathemat- 
ics. 


1617 


1717 


1617 



Using one of his many tele- 
scopes, Galileo is able to ob- 
serve the Milky Way. Previous- 
ly thought to be nebulous, he 
found it was in fact a multitude 
of stars packed so densely that 
they appeared from Earth to 
be clouds. He locates many 
other distant stars, including 
the double star Mizar in Ursa 
Major in 1617. 


The first of three volumes in Ke- 
pler's Epitome astronomiae Co- 
pernicanae is printed. This was 
to be his most influential work; 
containing all three laws of plan- 
etary motion. However, despite 
culminating in his ellipse-based 
system, it did not did not explain 
how elliptical orbits could be de- 
rived from observational data. 
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1817 

Bernard Bolzano provides the * 
first purely analytic proof of the 1 
intermediate value theorem. This 
important theorem states that if a 1 
continuous function, f, with an in- 
terval, [a, b\, as its domain, takes 
values f(a) and f (b) at each end f(a) 
of the interval, then it also takes 
any value between f(a) and f(b) 
at some point within the interval. 





y = f(x) / 


1 \ / 

7 

! 


a c b | 


1817 


1917 


2012 


On 4 July 2012, 
news comes out of 
CERN's Large Had- 
ron Collider of the 
discovery of a new 
particle with a mass 
between; physicists 
suspected that it 
was the Higgs bo- 
son. 


2017 

U 




1917 



Einstein applies the general theory of relativi- 
ty to the structure of the universe as a whole- 
He discovered that the general field equations 
predicted a universe that was dynamic, either 
contracting or expanding. As observational ev- 
idence for a dynamic universe was not known 
atthetime, Einstein introduced a new term,the 
cosmological constant, to the field equations, 
in order to allow the theory to predict a static 
universe.The modified field equations became 
known as the Einstein World. 
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The Orthoptic Locus: An 
lnequality 

H. T. Croft 


The aim of this note is to give an interesting inequality about shapes in the 
plane. Let K be a compact convex set in the plane (say with boundary S 
). The orthoptic locus T is the locus of points P in the plane at which K 
subtends a right angle; think of a set-square sliding round with its arms tan- 
gent to K. See Figure 1. 

Let the area between S and T be S, the area of K be A. Notice that if 
S is a circle then these two areas are exactly equal, since T is then a circle 
or radius \J~2 times that of S. It is natural to ask if we can make the area B 
of radius even smaller than A. 

Theorem: B > A , with equality only when S is a circle. 

Proof: By the orthoptic property we may enclose K in a £ box’. Let the dis- 
tances a, b, X, 7/, all dependent on 0, be as in Figure 2, with OJ = o;(0) 
the width of K in direction 0. The result follows form this sequence of 
inequalities and equations, with explanations of the non-trivial steps below. 



1 f Z7T 

B = — / a 2 d0, (see (1) below), 

2 J o 

1 f 2lT 

— — / b 2 d0, similarly, 

2 J o 

1 f 2n 

= — / (a 2 + b 2 )d0, averaging, 

4 J o 

1 f 27r 2 

= — J C^dO, by Pythagoras, 

1 f 2ir 

= — / ( Cl + C 2 + C 3 + C^)dO, by symmetry and averaging, 

lo J 0 

1 f 2n 

= 16 7 (^ 2 + y 2 )dO y (see(2)), 

/»27T 

> / ((^(ĕ 1 )) 2 + (w(6* + 57r)) 2 (i0, (see(3)), 

1 /' 27r f 

> - / ( W (0)) 2 d0 , as the / is over the whole period, 

>1:-•■>/!>■ 


, by Cauchy-Schwarz, 


> — (2L 2 )/2ti where L is the length of 5 (see (4)), 
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= L 2 / 4tt, 

> A , by the isoperimetric inequality. 

When K is a disc, we have an equality at each step, and only then for the last step (and some others). 

Explanations: 

1 . When S is suhiciently smooth (say continuously differentiable, the result follows from the usual consideration 
of differentials) (using: the area of an isosceles triangle sides a, enclosed (small) angle d9 ^ ^a^dO). For those 
with, rightly, some concerns when S has straight segments or corners, we can appeal to a limiting argument, the 
set of smooth S being dense in the set of all S . 

2. From the useful, but insuhiciently well-known, ‘the sum of the squares on the sides of a (convex) quadrilateral equals 
the sum of the squares on the diagonals together with 4 times the square on the segment joining the mid-points 
of the diagonals’. Proof by vectors, or complex numbers, or for those of a Euclidean turn of mind, 3 applications 
of Apollonius’ median theorem (if AM is a median of A AB then AB^ + AC^ = 2 AM^ + 2 MB^). 

3. Since X , y join points on parallel distances (jo{6\ resp. (jo{6 + ^ 71 ") apart. 

4. The average of the width Uj{6 ) of a convex body, ^ uj(9)d9 = average of its projection on an axis at angle 

6 — 71 length of the whole curve. Proof by differentials, again, using f | COs(0) \dd.) 


P 
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Sierpinski Tetrahedron 

Created by repeatedly contracting a tetrahe- 
dron to half its height, placing together four 
copies ofthis tetrahedron with corners touch- 
ing, and then repeating the process. 
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Why are Series Musical? 

Blanche Descartes 


M ost mathematicians know the theory of the 
game of Nim, described in books on math- 
ematical recreations. But few seems to be 
aware of Dr. P. M. Grundy s remarkable generalisation, 
published in Eureka , 2, 6-8 ( 1939 ). Consider a game T 
in which 2 players move alternately, and the last player 
wins (moving to a “terminal position”). Dehne induc- 
tively a function G(P) of the position P (fl(P) in 
Grundy s Paper) as follows 

• If P is terminal, G(P) = 0 ; 

• If there are permitted moves from P to Q, 
from P to R, from P to S , and so on, then 
G(P) is the least non-negative integer different 
from all of G(Q ), G(R ), G(S), etc. 

It follows that if 0 < V < G(P) there is a move from 
P to some R with G(R) = r, but no move to any 
position U with G(U) = G(P). If positions P 
with G(P) = 0 are called “safe,” the winning strat- 
egy is to move always to a safe position: either this is 
terminal, and wins immediately, or the opponent moves 
to an unsafe position and the cycle repeats. 

Now imagine the players engaging in a “simultaneous 
display” of k games ri,r 2 ,...,I\of this sort, the 
rule being that each player in turn makes a move in one 
and only one game, or if he cannot move in any game 
he loses. Let P\ , P^ , . . . , Pk be the positions in the 
respective games Fi,r 2 ,...,r^. Then Grundys 
Theorem states that 

1 . This combined position is safe if and only if k 
heaps of G(Pi), G^P^), . . . , G(Pk) coun- 
ters respectively form a safe combination in Nim; 
2. More generally, the G function of the combined 
position is the “nim-sum” of the separate G(P S ) 
(i.e. obtained by writing the G(P S ) in the scale of 
2 and adding columns mod 2). 

For no player can gain any advantage by moving so 
as to increase any G(P S ), as the opponent can re- 
store the status quo. And if only decreases in G(P S ) 
are considered, the game is identical with Nim, thus 
proving assertion (1). Therefore G(P) = g if and 
only if the combined position (P,P f ) is safe, where 
G(P') — g. From that (2) follows fairly readily. 


It follows that we can analyse any such combined game 
completely, provided that we can hnd the G(P S ) for 
the component positions. Nim is an example; a heap 
H x of X counters constitutes a component position, 
since each player in turn alters one heap only, and 
G(H X ) = X. Many variants of Nim are similarly an- 
alysed. Less trivial is Grundy s game, in which any one 
heap is divided into two unequal (non-empty) parts. 
Thus heaps of 1 , 2, are terminal, with G — 0, a heap 
of 3 can only be divided into 2 + 1, which is terminal, so 
G(H%) = 1. Generally G(H X ) in Grundy s game 
is the least integer > 0 different from all nim-sums of 
G(Hy) and G(H x _y ), 0 < y < x/2. The series 
goes 

x = 0 1 2 3 4 5 6 7 8 9 10 11 12, 
G(H X ) = 0001021021 0 2 1, 

continuing with 3, 2, 1, 3, 2, 4, 3, 0, 4, 3, 0, 4, 3, 0, 4, 
1, 2, 3, 1, 2, 4, 1, 2, 4, 1, 2, ... . This curious “somewhat 
periodic series” seems to be trying to have period 3, but 
with jumps continually occurring. Mr. Richard K. Guy 
conhrmed this up to X = 300. He suggested that it 
might be played on a piano, taking 0 to be middle C, 1 = 
D, 2 = E, etc. The inner meaning then became evident: 

Smu J-atH+1 j j i i j. jJj u ' -'i 1 Lii 1 Ijijjij 

Xividihg heops in djjcrcrtparb^ djg&entpar^notthc casititcf thc af&rqnono[juStyo(i tyj/WNo? 

Guy also worked with rows R x of X counters, in which 
certain sets of consecutive counters could be extracted 
(thus possibly leaving two shorter rows, one each side of 
the extracted set). In his “6” game, any one counter can 
be removed, except an R\ (= a single counter stand- 
ing on its own). The G(R X ) series (X — 1, 2, . . .) is 
a waltz (N.B. some notes span two bars): 

If In alone, all on my oum, ihm Imust always stay. 3ut ifl touch another such, 

l i' i j-j 

I may be taUen awaij. And as a booa this littU tunc shous yov ihc rg/rtmen fo pUy. 

But at this point the tune completely broke down. I 
asked Guy if he could think of any reason for that: he 
said, “Yes, an error I made in the calculation.” After cor- 
rection the waltz proceeds: 
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Ihis saies Sftll <juite baf fks me. The gencnal Hrm Tcarwjtset. Trhaja ct juSt uandess alay aimksdy 


This tries to be periodic with period 26, but jumps keep 
appearing. Many other such games give tuneful, some- 
what periodic series, for no evident reason. Guy discov- 
ered two curious exceptions: his “.4,” remove 1 counter 
not at the end of a row, has exact period 34 for x > 54 
, and Kayles, remove 1 or 2 adjacent counters, has exact 
period 12 for X > 71. Thus these games have a com- 
plete analysis. But generally it might be helpful to bring 
in a professional musician to study number theory. Per- 
haps a thorough study of Fermats Last Theorem will 
uncover the Lost Chord. After all, why not? 









A Brief History of 
Space-Filling Curves 

lan Stewart 

Emeritus Professor of Mathematics, University of Warwick 


In 1890 mathematicians caught a glimpse of the top- 
ological complications of continuous plane curves. 
Giuseppe Peano gave a construction for a continuous 
curve passing though every point in the unit square. 
Today we see this curve as an early precursor of Be- 
noit Mandelbrots notion of a fractal [3, 10]. At the 
time, counterexamples of this general kind were called 
pathological curves’, and many mathematicians reacted 
to them the way we usually react to pathology - with fear 
and loathing. Later, the profession got used to them and 
absorbed the deep topological lessons that they teach us. 

I’m going to tell some of their stories, show you the 
rather intricate constructions that early pioneers used 
to prove their existence, and then try to convince you 
there s a much simpler way. 

A Bigger lnfinity 

It all began with Georg Cantor. Cantor is famous for 
his introduction of transhnite cardinals - how to count 
the number of elements in an inhnite set. He famous- 
ly proved that some inhnities are bigger than others. 
More precisely, theres no one-to-one correspondence 
(bijection) between the integers and the real num- 
bers. Seeking a transhnite cardinal larger than that of 
the reals, for a time he became convinced that the car- 
dinal of the plane must be greater than that of a line. 
On 5 January 1874 he wrote to Richard Dedekind: 

Can a surface (say a square that in- 
cludes the boundary) be uniquely re- 
ferred to a line (say a straight line seg- 
ment that includes the end points) so 
that for every point on the surface there 
is a corresponding point of the line 
and, conversely, for every point of the 
line there is a corresponding point of 
the surface? I think that answering this 
question would be no easy job, despite 
the fact that the answer seems so clear- 
ly to be ‘no’ that proof appears almost 
unnecessary. 

Three years later he wrote again to say hed been wrong. 
Hed found a one-to-one correspondence between the 
unit interval [0, 1] C R and 71-dimensional space 


for any hnite n. T see it, but I dont believe it!’ 
he wrote. The main idea is simple: give two points 
X, y G [0,1] write them in decimals as 

X = 0 • X\X2X^X^ . . . , 

y = 0 • yiy2y3y4 • • •, 

and map this to 

0 • xiyiX2V2X3y3X4y4 ... , 

by interleaving the decimal places. Some care 
needs to be taken with non-unique representa- 
tion of a real number as a decimal, such as 

0 • 500000 • • • = 0 • 499999 . . ., but thats easy. 
Then you have to set up a bijection between [0,1] and 
M, which is also easy. 

In 1878 Cantor published some of these results in the 
Journal fur die Reine und Angewandte Mathematik , 
known informally as ‘Crelles Journal’ after its famous 
editor and founder August Crelle, in a paper that for- 
mulated the notion of a one-to-one correspondence. 
He investigated countable sets, and sets that are in 
one-to-one correspondence with each other. He dis- 
cussed ‘dimension, noted that the one-to-one cor- 
respondence between [0, 1] and the unit square 
does not preserve dimension, and emphasised that 
the map that he had constructed is not continuous. 

Cantor s ideas were controversial: some eminent math- 
ematicians considered them nonsense, probably be- 
cause they were so original that it required imagination 
and an open mind to appreciate them. Others, notably 
David Hilbert, declared the new area that Cantor had 
opened up to be a ‘paradise’. Full recognition of the im- 
portance of Cantor s work came only after his death. 

Space-Filling Curves 

Introduce the notation U = [0,1]. Then U 2 is the 
unit square, and more generally U n is hypercube in 
R n . Cantors bijection U —> U 2 shows that bijec- 
tions dont preserve dimension, but as he emphasised, 
his map is discontinuous, so theres no particular 
reason to expect dimension to be preserved. 

In 1879 Eugen Netto answered one obvious question by 
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Figure 1: Sierpinskis space-filling curve. 


proving that there is no continuous bijection between 
these sets, which is trickier than it might seem. 

The most signihcant breakthrough came in 1890, when 
Giuseppe Peano set the cat among the pigeons by show- 
ing that there is a continuous map of U onto U 2 . That 
is: there exists a continuous curve whose image hlls 
the unit square. The curve is not just dense in U 2 : it 
passes through every point. We now call such objects 
space-hlling curves. They show that our default mental 
image of a continuous curve can be distinctly mislead- 
ing. Peanos paper has no pictures. He dehnes the curve 
using base-3 expansions of points in the unit interval. 
Sagan [18] states that the construction is equivalent to 
the geometric one shown in Figure 2 (left). For an ex- 
planation see Jaffer [8]. In 1891 David Hilbert published 
another example of a space-hlling curve, and included a 
picture, essentially as in Figure 2 (right). 

Both constructions are fairly complicated: the hgures 
show an early stage of a recursive process in which 
simple polygons are repeatedly replaced by more elab- 
orate ones. What really matters here is not visible as 
such in the picture: how the curves are parametrised by 
t G [0,1]. The default is to use arc-length, scaled to 
make the total length 1. A curve in a space X is then 
dehned to be a continuous map 7 : [0,1] X. 

When we pass to the inhnite limit of the hnite polygons 
that are being constructed, we take the limit of these maps. 
Drawing just the image 7 ( [0, 1] ) can be misleading. 

Many other space-hlling curves have since been found. 
Moore [13] invented a modihcation of Hilberts con- 
struction that gives a closed space-hlling curve - one 


that returns to its starting point. Sierpinski devised one 
that hlls a triangle, Figure 1. Polya found another [17]. 
Two books on the topic are Bader [1] and Sagan [19]. 

A simpler construction, also attributed to Peano, is 
shown in Figure 3. As drawn, its not entirely clear how 
to trace the curve. There are several ways to draw the 
hrst stage as a continuous polygon. Peitgen, Jurgens, and 
Saupe [16] page 384 remark that any of these will sufhce. 
At the next stage of the construction, each straight line 
segment is replaced by a smaller version of the same 
looping polygon, traced in any continuous manner, and 
so on. The images of these curves all look like Figure 3, 
but the paths traced through them vary. 
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Figure 2: Left: An early stage in a geometric interpretation of Peanos 
space-tilling curve. Right: An early stage in the construction of 
Hilberts space-hlling curve. 


Space-hlling curves have applications to computing, 
such as storage and retrieval of multidimensional data, 
Lawder [9]. The basic idea is that we can traverse a 
multidimensional array by following an approxima- 
tion to a space-hlling curve, reducing the problems to 
the one-dimensional case. Another application yields 
a quick-and-dirty solution of the travelling salesman 
problem: hnd the shortest route along a road network 
that visits a specihed list of cities. The idea is to run a fi- 
nite approximation to a space-filling curve through the 
region containing the cities, order them using the pa- 
rameter of that curve, and then visit them in that order 
using the shortest linking route. This produces a route 
that is usually no more than 25% longer than the opti- 
mal one, Bartholdi [2]. This heuristic approach has been 
used to find an efficient route for Meals-on-Wheels in 
Atlanta, Georgia; to route the delivery of blood to hospi- 
tals; to target a space-based laser in Ronald Reagans hy- 
pothetical Strategic Defense Initiative (‘Star Wars’); and 
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Figure 3: Another space-hlling curve attributed to Peano. 


4 Iieraiions 


to control an XY plotter for computer graphics, where 
time spent moving the pen was reduced by a factor of 
twenty. (Today s graphics dont use mechanical plotters, 
of course, but similar gadgets exist for other purposes, 
including embroidery.) 

A Sufficient Condition 

These constructions are very widely cited, but they are 
fairly complicated. The simplest is Figure 3. Mandel- 
brot [10] page 64 credits a simpler approach to Ernesto 
Cesaro, but even his method alternates two different 
construction steps and hlls a triangle. However, there 
is an even simpler construction, which is very natural 
once you abandon the idea that the successive stages 
of the curve should look ever more wiggly. Wanting 
to do that in effect confuses the curve, as a path para- 
metrised by t G U, with its image. The image can look 
quite simple, even though the path that the curve traces 
is very wiggly. But people tend to seek complicated im- 
ages. 

I haven’t been able to hnd this construction in the lit- 
erature, but its so simple and natural that it must sure- 
ly be known to experts. Each stage produces a curve 
whose image is a dull, boring square grid. Thats not as 
pretty as all those more popular space-hlling curves, 
and again its impossible to infer how the curve wig- 
gles from a picture of its image. But the pictures are 
successive approximations to the hnal curve anyway. 
Their wiggly shape may offer some intuition about why 
the limiting curve hlls space, but this curve goes for 
the jugular, getting the desired result with a minimum 
of fuss and bother. 

All of the above constructions dehne the space-hll- 
ing curve 7 as the limit of a sequence of curves 
: U —> X C R 2 . A simple basic theorem gives 
sufficient conditions for such a limit to be space-hlling, 
stead in terms of the approximations 7 n . 


One crucial point is that the sequence should be uni- 
formly convergent. That is, given any £ > 0 there ex- 
ists N suchthat 

\\lm{x) - 7n0)|| < £ for all 

m,n > N and x G U, 

where 11 * 11 is the Euclidean norm. (Actually, any norm 
will do here.) This condition implies that for every 
X G U thelimit 

f(x) = lim f n (x), 

n—>-oo 

exists, and that f is continuous. We may then appeal to 
the following purely topological result, proved in most 
courses on real analysis. 

Theorem: Let 7 n : U —>• U 2 be a sequence of contin- 
uous curves, for n G N. Suppose that: 

(i) The sequence of functions (7n) is uniformly con- 
vergent. 

(ii) The union of the images of all 7 n is dense in U 2 . 
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Then the limit 


7 = lim 7„, 

n—)■ oo 

is continuous, and its image is U 2 . So 7 is a space-hll- 
ing curve. 

Proof: By condition (i), thelimit 7 exists and is contin- 
uous. Let z G U 2 . By condition (ii), for each m > 0, 
m G N, there exists t m G [0, 1] and U m G N such 
that 

|| 7 rim(trn) ~ %\\ < — • 
m 

The subsequence 7 Urn also tends uniformly to 7. 
The sequence (t m ) lies in [0, 1] which is closed and 
bounded, so it has a convergent subsequence, with limit 
t 0 . By uniform continuity, if m is large enough 

ll7n m (*o) -7^o)|| < < 

So 

Il7(4) - 21| < 7 

m 

for all m, so 7 (^ 0 ) = Z and 7 is onto. 

A Simpler Construction 

ni now construct such a sequence 7 n; lets call its limit 
7 the T-curve. 1*11 prescribe it geometrically, but mak- 
ing the construction precise is routine. 

Dehne 70 so that its image is the bound- 
ary of U 2 described anticlockwise, par- 
ametrised proportionally to arc-length; 
see top left of Figure 4. 

To obtain 71, replace each straight segment of 7 q 
by a T-shaped polygon, as shown at the bottom of 
Figure 4. The extra part of the T always extends 
to the left of the segment, as oriented by the ar- 
row. The path 71 goes along the original path to its 
midpoint, turns left, turns back on itself to return to the 
midpoint, and then continues along the original seg- 
ment to its endpoint. If we do this for each of the four 
segments of 70 and add the T-shaped paths (whose 
ends and starts fit together correctly) we obtain the 


path 71 at top right of Figure 4. For clarity, the grey 
polygon shows the direction in which t traverses 71 . 
Tbis path visits the centre of the square four times, but 
all segments are treated in the same manner, making it 
easier to dehne the 7 n inductively. We now repeat the 
same construction on every straight segment of 71 to 
obtain 72 , and continue inductively to dehne 7 n for 
all n G N. 

Tbe image of 7 n is a square grid of lines sep- 
arated by distance 2 n . Clearly the un- 
ion of these images is dense in U 2 : it consists 
of all (x, y) for which either X or y is of the form 
m2 for 0 < m < 2 n , where m G N. So con- 
dition (i) holds, and we can dehne 7 as the limit of the 
7 n . By considering the construction step at the bottom 
of Figure 4, it is easy to prove that 

It n(t) - 7n+l(0l < \[2 ■ 2~ n , 

from which condition (i), uniform convergence, follows 
by summing a geometric series. Condition (ii) is obvi- 
ous. 

Therefore 7 is a space-hlling curve. Like Moores, its 
closed. 

Although the image of 7 n is a grid, the actual struc- 
ture of 7 becomes increasingly complicated because of 
the way it zigzags back and forth. If we write (x n , 2 / n ), 
where x,y : U -A U, are themselves continuous, 
then Figure 5 shows the graphs of X 0 , yo and X\, y\. 

The slopes of the parts of these graphs that are not hori- 
zontal become increasingly steep as U increases. Nev- 
ertheless, the limit is continuous - because of uniform 
convergence, and more intuitively, because the intervals 
on which such slopes occur become arbitrarily small 
(though there are increasingly many such intervals). 

Fractal Structure and Dimen- 
sion 

These recursive constructions of space-hlling curves are 
examples of fractals, Mandelbrot [10]. That is, they have 
structure on all scales of magnihcation. They are (very 




Figure 5 : Graphs of real and imaginary parts of 70, 71- Grey lines included for comparison purposes. 
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close to) self-similar y that is, made up from smaller cop- 
ies of themselyes. 


For example, the T-curve can be split into four pieces, 
parametrised by t in the ranges [0, 1/4], [1/4, 1/2], 
[1/2, 3/4], and [3/4, 1]. Each piece hlls a right an- 
gled-isosceles triangle, joining a side of the unit share to 
the middle. And each pieces is made from four copies 
of itself, half the size. Figure 6 attempts to illustrate this 
by distorting the triangle slightly to leave small gaps. 
The curve visits the numbered points in the order of the 
numbers; note that 3, 6, and 9 are all the same point. 

I’m not just claiming that the triangle is self-similar. Fm 
claiming that the path the curve traces, with all its wig- 
gles, is self-similar. 

In fractal geometry theres a notion of fractal dimen- 
sion , more precisely Hausdorff-Besicovich dimension. I 
wont dehne it here because it s complicated; see Falcon- 
er [3]. In the self-similar case, the idea is more straight- 
forward. It takes 2 copies of the unit interval to make an 
interval of length 2. It takes 4 = 2 2 copies of the unit 
square to make a square of side 2. It takes 8 = 2 3 cop- 
ies of the unit cube to make a cube of side 2. In general, 
it takes 2^ copies of the unit d- dimensional hypercube 
to make a hypercube of side 2, and it takes copies 
of the unit d-dimensional hypercube to make a hyper- 
cube of side k. 

Fractals often have non-integer dimension. But in this 
case, it takes 4 copies of the T-curve to make one twice 
as big, so 4 = 2^ and d = 2. Thats not a great sur- 
prise, because a space-hlling curve hlls space, and there 
the space has dimension 2. Still, its reassuring that the 
numerology works out correctly. 

Filling Other Shapes 

What other shapes can a curve fill? Its easy to find a 
continuous bijection from U 2 to a sold circular disc, 
and composing with this, we get a curve that fills the 
disc. The same trick constructs a curve that fills any 
space topologically equivalent to U 2 , and those can be 
fairly complicated. However, there are many other pos- 


sibilities. For instance, two solid squares that are joined 
by a curve can also be filled by linking two space-filling 
curves together. 

Hilberts construction extends to three dimen- 
sions, giving a curve that fills the unit cube U 3 . 
But once again, theres a simpler way to prove that 
such a curve exists. The existence of a space-fill- 
ing curve means that we can parametrise 
U 2 by t G U . So we can parametrise U 3 = U 2 X U 
by U X U . But this is U 2 which can be parametrised 
by U. Ifwe let 7(t) = (x(t), y(t)), andthen define 

9(t) = ( x(x(t)),y(x(t)),y(t )). 

we get a curve 6 that fills U 3 . Continuing with this 
trick, the curve 

4>(t) = (x(x(x(t))),y(x(x(t))) f 
y(x(t)),y(t)), 

fills U 4 , and so on. Inductively, theres a con- 
tinuous map from U onto U n for any finite 71. 
Indeed, if 771, 71 > 0 are integers, there exists a 
continuous map from U m onto U n , even when 
m < n. 



The last word in such results is the Hahn-Mazurkiewicz 
Theorem, proved by Hahn [4, 5] and Mazurkiewicz [11, 
12 ]. This states that a set S is the image of a continuous 
map from U to a Hausdorff topological space if and 
only if S is compact, connected, locally connected, and 
metric. See Hocking and Young [7] page 129. Interest- 
ingly, the proof involves another idea of Cantor s: the 
Cantor set. But thats another story. 
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Forgotten Mathematicians 

Maria Santos 

Undergraduate, Murray Edwards 


M any of us have heard about the outstanding 
contributions given by the ancient Greeks 
and European mathematicians in the 16th, 
17th and 18th centuries, however there are many oth- 
er civilizations that made extremely valuable advance- 
ments in mathematics. 

Egyptian Maths 

In around 6000 BCE, the ancient Egyptians settled 
along the Nile Valley, due to its fertile land. For both 
religious and agricultural reasons, they began to record 
the patterns of lunar phases and the seasons. In very 
early Egyptian history, the Pharaohs surveyors used 
their body parts to take measurements of the land and 
buildings, for example the cubit, and consequently a 
decimal numeral system was born, based on our ten 
hngers. 

It is thought that the Egyptians introduced the earliest 
fully-developed base 10 numeration system - at least as 
early as 2700 BC; a stroke was used for units, a heel- 
bone symbol for the tens, a coil rope for the hundreds, 
etc. 

The oldest mathematical text is thought to be the Mos- 
cow Papyrus, which dates back to the 12th dynasty in 
2000-1800 BC. The Moscow Papyrus and the Rhind 
Papyrus, which dates from around 1650 BCE, contain 
mathematical problems. Most of these are practical, but 
a few are posed to teach the manipulation of the number 
system without practical application. The Rhind Papy- 
rus contains evidence of other mathematical knowledge 
such as unit fractions, composite and prime numbers, 
arithmetic and harmonic means and arithmetic and 
geometric series. Furthermore the ancient Egyptians 
were the hrst civilization to develop and solve quadratic 
equations, as revealed by the Berlin papyrus. 

Practical problems of trade and the market led to the 
development of fractions, and in particular unit frac- 
tions. They could also be used for the simple division 
sums, for example say they needed to divide 3 loafs 
among 5 people: 

• First, they would hrst divide 2 of the loaves into 
thirds; 

• then divide third loaf into fifths; 

• they would then divide the third that was left over 



Figure 1: Map of Ancient Egypt 


into 5 pieces; 

• therefore, each person would get 1/3 + 1/5 + 1/15 
= 3/5. 

Multiplication, on the other hand, was achieved by the 
process of repeatedly multiplying the number in ques- 
tion and one, by two on two separate sides. The corre- 
sponding numbers could then be used as a sort of mul- 
tiplication reference table: 

• The combination of powers of two which add up to 
the number to be multiplied was isolated; 

• the corresponding numbers on the other side add 
to give the answer. 
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Figure 2: Cubit measurements 
used by ancient Egyptians 
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Figure 3: Ancient Egyptian numerals 


Babylonian Maths 

The Babylonians lived in a region of Mesopotamia, now 
known as Iraq. 

As with most ancient civilisations, a mathematical sys- 
tem developed as the bureaucratic need for a system to 
measure plots of land, tax individuals etc. arose due to 
the settlement of the civilisation. 
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multiplication example 


There is evidence that from around 2600 BCE onwards 
the Babylonians produced multiplication and division 
tables, tables of squares, square and cube roots and 
worked on geometrical exercises and problems involv- 
ing long division. Furthermore, late tablets from 1800 
to 1600 BCE show topics such as fractions, algebra and 
methods for solving linear, quadratic and cubic equa- 
tions being tackled. The Babylonian mathematicians 
also produced approximations including V2, which was 
accurate to five decimal places! 



The Babylonians used an advanced number system with 
a base of 60, rather than 10. Counting physically in this 
base system was done using 12 knuckles on one hand 
and five fingers on the other. Unlike the other number 
systems used at the time by the Egyptians, Greeks and 
Romans, it was a true place-value system, where the dig- 
its in the left column represented larger powers of 60. 
The numbers 1-59 were given using two symbols that 
were combined in distinct ways. 

Base 60 was a wise choice of base system, as it has been 
conjectured that the advances the Babylonians made in 
mathematics was greatly facilitated by the fact that 60 
has such a large number of factors: 60 is the smallest 
number to be divisible by all numbers from 1-6. The 
remnants of this number system can still be seen to- 
day; it was the Babylonians that divided the day into 24 
hours, with 60 minutes in an hour and 60 seconds in a 
minute. 

































Another great mathematical advance by the Babyloni- 
ans was the concept of the number zero, which was rep- 
resented by a circle character, something that had not 
been recognized by the Egyptians, Greeks or Romans. 
However, they are not necessarily credited with its dis- 
covery as it was used more as a placeholder, rather than 
being a number used in calculations. 
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: Babylonian Numerals 





One of the most astonishing aspects of Babylonian 
mathematics and their calculating skills was their con- 
struction of tables to facilitate their calculations. For ex- 
ample, two tablets that were found in Euphrates River 
give squares of the numbers up to 59 and cubes up to 
32. To construct these tables Babylonians used formulae 
to make the calculations easier. For instance, to com- 
pute square numbers the following formula was used to 
make the multiplication easier 

4 

On the Plimpton 322 clay tablet, which now lives in the 
British Museum, the following is written 

“4 is the length and 5 the diagonal. What is the breadth ? 
Its size is not known. 

4 times 4 is 16. 

5 times 5 is 25. 

You take 16 from 25 and there remains 9. 

What times what shall I take in order to get 9 ? 

3 times 3 is 9. 

3 is the breadth.” 

Many claim that the Babylonians may have had an un- 
derstanding of Pythagoras theorem before the Greeks. 
This claim is fortified by the fact that the Babylonians 
understanding of quadratics was extensive. However, 
there is a lot of controversy over this as, due to the dam- 
age and age of the tablet, interpretations of the writings 
vary greatly. 

Babylonians were not only fantastic at pure mathemat- 
ics, but were also very competent in astronomy and 
place great value in its study. They believed that if one 
understood what happened in the skies, one would 
know what would happen on Earth, as they were con- 


nected. The Babylonians have recently been in the news 
over the deciphering of a clay tablet, by Ossendrijver, 
that reveals an early form of integral calculus to calcu- 
late the path of Jupiter, a technique, which was thought 
to have been invented in Medieval Europe. 


Although it was previously believed that Babylonians 
did not use geometry for their astronomical calcula- 
tions, this tablet undeniably shows that the Babylonians 
had geometric understanding, allowing them to develop 
a geometric technique to make arithmetic calculations. 

Mayan Maths 

The Mayan civilisation settled in the region of Central 
America from around 200BCE, and at its peak, it was 
one of the most densely populated and culturally dy- 
namic societies in the world. 

The Mayans placed great importance on astronomy and 
calendar calculations, resulting in the construction of a 
very sophisticated number system. They used a vigesi- 
mal number system (based on base 20, and to some ex- 
tent 5), which is said to have originated from counting 
on fingers and toes. Their numerals consisted of only 3 
symbols: a dot representing 1, a bar representing 5 and a 
shell representing 0. The fact that the Mayans knew the 
value of 0 is incredible, as most of the worlds civilisa- 
tions at that time had no concept of 0. 
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These three symbols were used in yarious combinations 
to represent the numbers 1-19, Numbers larger than 
19 were represented by the same kind of sequence, but 
a vertical place system format was used to show each 
power of 20. As addition and subtraction was done by 
adding up the dots and bars, even uneducated people 
could do simple arithmetic in trade and commerce. 



Figure 9: Mayan Empire Map 
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What is this number? 




The number isllxl + lx20 + 10x20 xl8=ll+20 
+ 3600 = 3631. 

In Mayan society, mathematics was an extremely im- 
portant discipline and appears in Mayan art, such as 
wall paintings, where mathematics scribes or mathema- 
ticians can be identihed by number scrolls, which trail 
from under their arms. Interestingly, the first mathema- 
tician identified in one of these paintings was a female 
figure. 

Furthermore, despite not having the concept of a frac- 
tion, Mayan mathematicians produced extremely ac- 
curate astronomical observations using no tools other 



than sticks. For example, they were able to measure the 
length of a solar year to a far higher degree of accuracy 
than that used in Europe - their calculations gave an 
answer of 365.242 days, compared to the modern value 
of 365.242198 - as well as the lunar month - their val- 
ue was 29.5308 days, compared to the modern value of 
29.53059. Thats incredibly accurate! 

However, due to their geographical disconnect from 
other civilisations, their mathematical discoveries had 
no inAuence on the European and Asian numbering 
systems or mathematics. 

Indian Maths 

Mathematics owes a huge debt to the extraordinary 
contributions given by Indian mathematicians over 
many hundreds of years, however there has been a re- 
luctance to recognise this. 

The earliest expression of mathematical understanding 
is linked with the origin of Hinduism, as mathematics 
forms an important part of the Sulbasutras, which are 
the appendices of the Vedas - the original Hindu scrip- 
tures. They contained geometrical knowledge showing 
a development in mathematics, although it was purely 
for practical religious purposes. Additionally, there is 
evidence of the use of arithmetic operations including 
square, cubes and roots. 

The Sulbasutras were composed by Baudhayana 
(around 800 BC), Manava (about 750 BCE), Apastamba 
(about 600 BC) and Katyayana (about 200 BC). 

Before the end of this period in the middle of the 3rd 
century, the Brahmi numerals began to appear. Indian 
mathematicians refined and perfected the numeral sys- 
tem, particularly with the representation of numerals 
and, thanks to its dissemination by medieval Arabic 
mathematicians, they developed into the numerals we 
use today. 
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Jainism was a religion and philosophy, which was 
founded in India around the 6th century BC. The main 
topics of Jaina mathematics in around 150 BC were the 
theory of numbers, arithmetical operations, operations 
with fractions, simple equations, cubic equations, quar- 
tic equations, and permutations and combinations. 

Furthermore, Jaina mathematicians, such as Yativrsab- 
ha, recognised five different types of infinities: infinity 
in one direction, two directions, in area, infinite every- 
where and perpetually infinite. 

Mathematical advances were often driven by the study 
of astronomy, as it was the science that required accu- 
rate information about the planets and other heavenly 
bodies. 

Yavanesvara (2nd century AD) is credited with trans- 
lating a Greek astrology text dating from 120 BC, and 
in doing so, he adapted the text to make it work into 
Indian culture, using Hindu images with the Indian 
caste system integrated into his text, thus popularising 
astrology in India. 
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Figure 11: Hindu-Arabic Numerals 


Aryabhata was also an important mathematician. His 
work was a summary of Jaina mathematics, as well as 
the beginning of the new era for astronomy and math- 
ematics. He headed a research centre for mathematics 
and astronomy, where he set the agenda for research in 
these areas for many centuries to come. 

Living in the beginning of the 7th century AD, Brah- 
magupta made major contributions to the development 
of the numbers systems with his remarkable contribu- 
tions on negative numbers and zero. The use of zero as 
a number that could be used in calculations and math- 
ematical investigations would revolutionise mathemat- 
ics. He established the mathematical rules for using the 
number zero (except for division by zero), as well as 
establishing negative numbers and the rules for deal- 
ing with them. This was another huge conceptual leap, 
which had profound consequences for future mathe- 
matics. 

He wrote down his concepts using the initials of the 
names of colours to represent unknown in his equa- 
tions; this was one of the earliest intimations of what we 
now know as algebra. 

Additionally, he worked on solving quadratic equations 


with two unknowns, something, which was not even 
considered in the West until a thousand years later, 
when Fermat was considering similar problems in 1657. 
He dedicated a substantial portion of his work to geom- 
etry, and one of his biggest achievements was obtaining 
the formula for the area of a cyclic quadrilateral, now 
known as Brahmaguptas Formula, as well as a celebrat- 
ed theorem on the diagonals of a cyclic quadrilateral. 

The golden age of Indian mathematics is labelled as be- 
ing from the 5th to 12th centuries. In this period, funda- 
mental advances were made in the theory of trigonom- 
etry. They utilised sine, cosine and tangent functions to 
survey the land around them, navigate seas and chart 
the skies. For example, Indian astronomers used trigo- 
nometry to calculate the relative distances between the 
Earth and the Moon and the Earth and the Sun. They 
realised that when the Moon is half full and directly op- 
posite the Sun, then the Sun, Moon and Earth form a 
right-angled triangle. By accurately measuring the angle 
as 1/7°, using their sine tables, that gave a ratio of the 
sides of such triangle as 400:1, it showed that the Sun is 
400 times further away from the Earth than the Moon. 

Bhaskara II lived in the 12th century and is considered 
on of the most accomplished of Indias mathematicians. 
He is credited with explaining that the division by zero 
- a previously misunderstood calculation - yielded in- 
finity. 



Figure 12: Brahmaguptas Theorem on Diagonals of a Cyclic 
Quadrilateral 

He also made important contributions to many differ- 
ent areas of mathematics, and many of his discoveries 
predate similar ones made in Europe by several centu- 
ries. 

The Kerala School of Astronomy and Mathematics 
was founded late 14th century by Madhava of Sanga- 
magrama. Madhava also developed an infinite series 
approximation for tt, by realizing that by successively 
adding and subtracting different odd number fractions 
to infinity, he could establish an exact formula for tt, 
a conclusion that was made by Leibniz in Europe two 
centuries later. Applying this method, he calculated tt 
correct to 13 decimal places! Using this mathematics, 
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he went on to obtain inhnite series expressions for sine, 
cosine, tangent and arctangent. Arguably more remark- 
able though, was the fact that he gave estimates of the 
correction term, implying that he had an understanding 
of the limit nature of the inhnite series. 

Furthermore, he made contributions to geometry and 
algebra, and laid the foundations for later development 
of calculus and analysis, such as the differentiation and 
integration for simple functions. It is speculated that 
these may have been transmitted to Europe via Jesuit 
missionaries, making it possible that the later European 
development of calculus was inhuenced by his work, to 
some extent. 

Arabic Maths 

In many ways the mathematics we study today is sty- 
listically more similar to that of the Arabics, not of the 
Greeks. 

When referring to Arab mathematicians, I refer to those 
that came from the region that was centred in Iran and 
Iraq. However, this area varied with military conquest 
during the period. At its greatest, the empire stretched 
west through Turkey and North Africa, and came as far 
east as the borders of China. 

It is widely believed that after the period of when the 
Greeks lay the foundations for modern mathematics, 
there was a period of stagnation before the Europeans 
took over at the beginning of the 16th century. This per- 
ception is highlighted in statements such as that made 
by Duhem in £ Le Systĕme du Monde’: 

“...Arabic science only reproduced the teachings re- 
ceived from Greek science.” 

But, this is a major misconception. During the Arabic 
period, beginning in the 9th century, there were many 
talented mathematicians who made some important 
discoveries. This period started when Al-Mamum es- 
tablished Baghdad as the new centre of wisdom and 
learning. A research institute, called ‘Bayt al-Hikma 
which translates to £ The House of Wisdom’ was creat- 
ed and lasted for more than 200 years. Al-Mamum was 
also responsible for the large translation project, which 
saw the major works of the Greeks being translated. 
Furthermore, they learned the mathematics of the Bab- 
ylonians and the Hindus. 

Rather than giving a broad overview on Arabic math- 
ematics, I will detail a few prominent mathematicians 
and their accomplishments. 

Abu Ja'far Muhammad (around 790 - 850): His most 
important work was presented in his book £ Al-jabr wal 
Mugabala, written in 830, and was translated into Latin 
and used in Europe for generations. In it, he uses the 
word algebra for the hrst time, classihes the solution 


of quadratic equations and gives geometric methods for 
completing the square. He also notes six different types 
of quadratic equations: 

ax 2 = bx , 

ax 2 = c, 

bx = c, 

ax 2 + bx = c, 

ax 2 + c = 

ax 2 — bx + c. 

Abu Kamil Shuja ibn Aslam ibn Muhammad ibn Shuja 
(around 850 - 930): He worked on integer solutions of 
equations, as well as giving the solution of a fourth de- 
gree equation and of quadratic equations with irrational 
coefficients. It is argued that his biggest advance was his 
use of irrational coefficients, and his work was the basis 
of Fibonaccis books. 

Abu'l-Hasan Thabit ibn Qurra (826 - 901): Thabit in- 
herited a large family fortune, enabling him to go to 
Baghdad and obtain mathematical training. He gener- 
alised Pythagoras s theorem to an arbitrary triangle and 
is regarded as the Arabic equivalent of Pappus - the 
commentator on higher mathematics. Additionally, he 
founded a school that translated works by Euclid, Ar- 
chimedes, Ptolemy and Eutocius (note that Diaphan- 
tus and Pappus were unknown to the Arabics until the 
lOth century. Many ancient books would have been lost 
without his effort). 

Mohammad Abu'l-Wafa al'Buzjani (940-998): He trans- 
lated many works of Euclid, Diophantus - for example 
Arithmetica’ - and Al-Khwarizimi that had been lost. 
However, he is most famous for his use of the tangent 
function for the hrst time and compiling tables of sines 
and tangents at 15-degree intervals. These values are ac- 
curate to 8 decimal places - an astonishing achievement 
considering Ptolemy s were only accurate to 3 decimal 
places! 

Abu Bakr al-Karaji (early 11th century): More than any 
other Arabic mathematician, al-Karajis mathematics 
pointed in the direction of Renaissance mathematics. 
Some of his achievements include giving a numerical 
solution to equations of the form: 

ax 2n + bx n = c . 

Furthermore, using mathematical induction, meaning 
it was extendable to all integers, he proved the following 
fact 

l 3 + 2 3 + • • • + 10 3 = (1 + 2 + • • • + 10) 2 . 

Omar Khayyam (1048 - 1122): His workon algebra was 
known throughout Europe in the middle Ages. As well 
as discovering a geometrical method to solving cubic 
equations by intersecting a parabola with a circle, he 
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contributed to a calendar reform and gave important 
results on ratios, by giving them a new dehnition to in- 
clude the multiplication of ratios. 

Ghiyath al'Din Jamshid Mas'ud aTKashi (1390-1450): 
AbKashi applied a method, now known as fixed-point 
iteration, to solve a cubic equation having sin 1° as a 
root. He also worked on solutions of systems of equa- 
tions and developed methods for finding the nth root 
of a number, which is known as Horner s method today. 

Chinese Maths 

It can be said that Chinese mathematics has the longest 
booming period, which lasted from the Christian era 
to the 14th century AD. Ancient Chinese mathematics 
witnessed three golden ages during the Western and 
Eastern Han Dynasties, the Wei, Jin and the Northern 
and Southern Dynasties, and finally the Song and Yuan 
Dynasties, where the development of mathematics 
reached its height. 

The simple, yet elfective ancient Chinese numbering 
system, which dates back to the 2nd millennium BCE, 
used bamboo rods arranged in different ways to repre- 
sent the numbers 1 to 9. They were then placed in col- 
umns to represent the units, tens and hundreds etc. It 
can be noted that they used the decimal place value sys- 
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Figure 13: Chinese Numerals - Two Possible Representations 


tem similar to the one we use today, and in fact was the 
first of this kind of number system. The Chinese adopt- 
ed it over a thousand years before the West, and it made 
complex calculations very quick and easy. 

However, the Chinese had no concept of ‘zero’ which 
limited their calculations. Furthermore, although the 
abacus is largely considered a Chinese invention, some 
variations of the abacus were being used in Mesopota- 
mia, Egypt and Greece much earlier than in China. 

The Chinese were fascinated by patterns in mathemat- 
ics, and paid particular attention to magic squares as 
they were considered to have great spiritual and reli- 
gious significance. These are squares of numbers where 
each row, column and diagonal add up to the same to- 
tal. The oldest known magic square is to Lo Shu Square, 
which dates back to 650 BCE. It is a 3x3 square where 
each row, column and diagonal add up to 15. 


The main difference between the Chinese and ancient 
Greek mathematics is that whilst the Greeks focused 
on proving theorems, the Chinese devoted their time to 
creating efficient calculation methods, especially those 
for solving equations. As a result, they created advanced 
methods for solving systems of linear equations, equa- 
tions of higher degree and indeterminate equations. 

Notable figures in Chinese mathematics include Liu 
Hui, Sun Tzu and Qin Jiushao. 

Liu Hui (Around 220 AD): Lui Hui is considered one 
of the most famous Chinese mathematicians. He pub- 
lished the book ‘The Nine Chapters on the Mathemati- 
cal Art’ which contained the solutions to mathematical 
problems that were written in the form of decimal frac- 
tions. He was one of the first mathematicians known 
to leave roots unevaluated, giving more exact results, 
rather than approximations. In the Nine Chapters, he 



Figure 14: Lo Shu Square 


proposed an algorithm to calculate the value of n by an 
approximation using a regular polygon with 192 sides. 
Furthermore, he commented on a diagram of a seem- 
ingly identical proof to the famous Pythagorean theo- 
rem: 

“the relations between the hypotenuse and the sum and 
difference of the other two sides whereby one can find 
the unknown from the known” 

He was also a pioneer in the field of empirical solid ge- 
ometry. For example, he discovered that a wedge with a 
rectangular base and both sloping sides could be broken 
into a pyramid and a tetrahedral wedge. 

Sun Tzu (3rd Century CE): Sun Tzu’s biggest contribu- 
tion was creating the Chinese Remainder Theorem. It 
is considered one of the jewels of mathematics and is 
used in a variety of fields from Chinese astronomers in 
the 6th century AD to today in Internet cryptography. 
This theorem states that if you know the remainders of 
the division of an integer n by several integers, then you 
can determine uniquely the remainder of the division 
of n by the product of these integers, provided that the 
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divisors are pairwise coprime. 

Qin Jiushao: The 13th Century is considered to be one 
of the golden ages of Chinese mathematics, and during 
this period there were over 30 prestigious mathematics 
schools in China. However, although contemporary au- 
thors mention his ambitious and cruel personality, per- 
haps the most brilliant Chinese mathematician of this 
era was Qin Jiushao. Jiushao wrote the book ‘Mathe- 
matical Writings in Nine Sections’, which is divided into 
nine categories each containing nine problems. Tbe two 
most important methods he created are for the solution 
of simultaneous linear congruences and an algorithm 
for obtaining a numerical solution of higher degree pol- 
ynomial equations based on a process of successively 
better approximations, which was rediscovered in Eu- 
rope in the 19th century and is known as the Rufhni- 
Horner method. 

These mathematicians were exceptional people who 
shaped modern mathematics with their contributions. 
This makes me wonder how much is yet to be discov- 
ered, and how much mathematics has been lost in the 
course of history. 
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Hence 


Dustcartes 


T he word “hence” often crops up in mathematical 
writings. The time has now come to reveal the 
true story behind its use. 

In the town of Clangalot, in Medieval England, a crisis 
was arising. King Arthur (no relation to Ben Hur) had 
become so annoyed with knights armed with compass 
and dividers trampling over his table trying to square 
the circle that he was seriously considering changing it 
back to a rectangular one. But Merlin, the court magi- 
cian (the King court him pinching Queen Guinivere), 
had other ideas. He advised Arthur to send the knights 
out on long and dithcult quests, to keep them out of the 
way. Since there were many knights (one for every day 
of the year), this was not easy, but with Merlins help it 
was accomplished. 

Now one knight, Sir Chancealot (so called because of 
his practical interest in probability theory), was sent to 
search for a fabulous long-trunked beast, the Eliminant, 
found in the equilateral jungles of darkest (or at any 
rate darker) Africa. As he rode through the forest on 
his milk-white steed (well, you try drinking your dai- 
ly pinta on the back of a horse), he suddenly heard a 
scream coming from a clearing ahead. He spurred on 
his horse, and arrived just in time to see a great drag- 
on (a species of Ptetrahedron, now extinct) bearing 
down on a beautiful young vector function, with long 
golden curls and zero div. Quickly drawing his semi-in- 
hnite broadsword (they hadnt invented sixth-rooters in 
those days) Sir Chancealot rode to the attack. When he 
reached the spot, the dragon had retreated 8 feet. Once 
more he approached. The dragon retreated 4 feet. Again 
he tried. The dragon retreated 2 feet. “You cant fool 
me with Zenos paradox!” cried the knight, and quickly 
summing the geometric series, he advanced 4 more feet 
and struck! 

The dragon crumpled to the ground, pierced through 
the origin. Sir Chancealot then rode over to where the 
damsel lay on the ground in a dead faint. He picked 
her up, and set out for the nearest invariants subspace, 
where he was received with great joy, for the damsel had 
been feared lost. 

The villagers inquired the purpose of his travels. When 
they heard that he was searching for an Eliminant, they 


suggested he try the gnomes-they might know. So Sir 
Chancealot set out for Gnomeland without delay. 

At the gateway, over which hung a sign saying “Gnome 
sweet Gnome,” he was accosted by a raggedly-dressed 
gnome who wanted him to buy some eels. “Why on 
Earth should I?” he asked. “Because of the Buy Gnome- 
eel theorem!” 

He rode past the gate, and over a bridge with parabolic 
arches. Through the foci of these were the remains of 
cords, frayed and broken. He asked how this had hap- 
pened. It appeared that one gnome called Quintus Latus 
(meaning Take Five), occasionally went on a pub crawl, 
and, like castor oil, was nasty when drunk. “Cords 
through the foci? Why, Latus wrecked em!” 

This, of course, was no help in his quest, but luckily a 
wiser gnome than the others suggested he try asking 
an ogre, who lived in the mountains, and was reputed- 
ly very clever. As he rode along, he heard the ogre ap- 
proach, singing “Ogre the hills and far away.” Now the 
ogre was a nice ogre really, and only devoured damsels 
in distress because oxen were out of fashion, so when he 
heard what Sir Chancealot wanted, he was as helpful as 
he could be. Unfortunately, the only Eliminant he knew 
of was degenerate. And this was no use. 

Discourages, Sir Chancealot decided to give up his 
quest, retire from knighthood, and take up chick- 
en-farming. So he went to Baron Humpdinger for ad- 
vice, saying “I come not as a knight, I come for hens!” 

When King Arthur found out what a rotten cowardly 
knight he was, he changes his name to “Sir Come-for- 
hens,” which, in the course of time, became “Circum- 
ference”. 

He became so infatuated with his hens that he began to 
address them as if they were human (he thought they 
were). £ Hens,” he would say, “we are going to square 
some roots today,” or “Hens, we have solver our prob- 
lem.” This manner of speaking was rapidly adopted 
by the mathematicians of the time, who, being only 
ignorant mathematicians, couldnt spell, and wrote it 
“hence.” 

First printed in Eureka 2,1939 
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Oh My Gosh, lt's Covered 

in Rule 30s! 

Stephen Wolfram 


A week ago [21 May 2017] a new train station, 
named “Cambridge North”, opened in Cam- 
bridge, UK. Normally such an event would be 
far outside my sphere of awareness. (I think I last took a 
train to Cambridge in 1975.) But last week people start- 
ed sending me pictures of the new train station, won- 
dering if I could identify the pattern on it 
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And, yes, it does indeed look a lot like patterns I’ve 
spent years studying - that come from simple programs 
in the computational universe. My hrst - and still fa- 
vorite - examples of simple programs are one-dimen- 
sional cellular automata like this 
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The system evolves line by line from the top, determin- 
ing the color of each cell according to the rule under- 
neath. This particular cellular automata I called “rule 
182”, because the bit pattern in the rule corresponds 
to the number 182 in binary. There are altogether 256 
possible cellular automata like this, and this is what all 
of them do 



Many of them show fairly simple behavior. But the huge 
surprise I got when I hrst ran all these cellular automata 
in the early 1980s is that even though all the rules are 
very simple to state, some of them generate very com- 
plex behavior. The hrst in the list that does that - and 
still my favorite example - is rule 30 
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If one runs it for 400 steps one gets this 



And, yes, its remarkable that starting from one black 
cell at the top, and just repeatedly following a simple 
rule, it s possible to get all this complexity. I think it s ac- 
tually an example of a hugely important phenomenon, 
that s central to how complexity gets made in nature, as 
well as to how we can get a new level of technology. And 
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in fact, I think it s important enough that I spent more 
than a decade writing a 1200-page book (that just cele- 
brated its 15th anniversary) based on it. 

And for years I’ve actually had rule 30 on my business 
cards 



But back to the Cambridge North train station. Its pat- 
tern is obviously not completely random. But if it was 
made by a rule, what kind of rule? Could it be a cellular 
automaton? 


I zoomed in on a photograph of the pattern 



Suddenly, something seemed awfully familiar: the tri- 
angles, the stripes, the L shapes. Wait a minute... it 
couldnt actually be my favorite rule of all time, rule 30? 

Clearly the pattern is tipped 45° from how Id usually 
display a cellular automaton. And there are black trian- 
gles in the photograph, not white ones like in rule 30. 
But if one black-white inverts the rule (so its now rule 
135), one gets this 



And, yes, its the same kind of pattern as in the photo- 
graph! But if its rule 30 (or rule 135) whats its initial 
condition? Rule 30 can actually be used as a cryptosys- 
tem - because it can be hard (maybe even NP complete) 
to reconstruct its initial condition. 

But, OK, if it s my favorite rule, I wondered if maybe it s 
also my favorite initial condition - a single black cell. 
And, yes, it is! The train station pattern comes exactly 
from the (inverted) right-hand edge of my favorite rule 
30 pattern! 



Heres the Wolfram Language code. First run the cellu- 
lar automaton, then rotate the pattern 


in(i) Rotate[ArrayPlot[CellularAutomaton[135, {{1>, 0}, 40], 
Mesh -»True], -45°] 
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Its a little trickier to pull out precisely the section of the 
pattern that s used. Heres the code (the PlotRange is what 
determines the part of the pattern thats shown) 


Graphics[Rotate[First[ArrayPlot[CellularAutomaton[135, {{1}, 0}, 80], Mesh -» True]], 
-45 °], PlotRange -+ {{83,104}, {-12,60}}] 


gi 

m 

pn 


OK, so where is this pattern actually used at the train sta- 
tion? Eyerywhere! 



Its made of perforated aluminum. You can actually look 
through it, reminiscent of an old latticed window. From 
inside, the pattern is left-right reversed - so if its rule 135 
from outside, its rule 149 from inside. And at night, the 
pattern is black-white inverted, because theres light com- 
ing from inside - so from the outside it’s “rule 135 by day, 
and rule 30 at night”. 


What are some facts about the rule 30 pattern? Its ex- 
tremely hard to rigorously prove things about it (and 
thats interesting in itself - and closely related to the fun- 
damental phenomenon of computational irreducibility). 
But, for example - like, say, the digits of n - many aspects 
of it seem random. And, for instance, black and white 
squares appear to occur with equal frequency - meaning 
that at the train station the panels let in about 50% of the 
outside light. 


If one looks at sequences of n cells, it seems that all 2 n 
conhgurations will occur on average with equal frequen- 
cy. But not everything is random. And so, for example, if 
one looks at 3x2 blocks of cells, only 24 of the 32 possible 
ones ever occur. (Maybe some people waiting for trains 
will hgure out which blocks are missing...) 


When we look at the pattern, our visual system particu- 
larly picks out the black triangles. And, yes, it seems as 
if triangles of any size can ultimately occur, albeit with 
frequency decreasing exponentially with size. 


If one looks carefully at the right-hand edge of the rule 30 
pattern, one can see that it repeats. However, the repeti- 
tion period seems to increase exponentially as one goes 
in from the edge. 

At the train station, there are lots of identical panels. But 
rule 30 is actually an inexhaustible source of new pat- 
terns. So what would happen if one just continued the 
evolution, and rendered it on successive panels? Heres 
the result. Its a pity about the hint of periodicity on the 
right-hand edge, and the big triangle on panel 5 (which 
might be a safety problem at the train station). 



Fifteen more steps in from the edge, theres no hint of 
that anymore 



What about other initial conditions? If the initial condi- 
tions repeat, then so will the pattern. But otherwise, so far 
as one can tell, the pattern will look essentially the same 
as with a single-cell initial condition. One can try other 
rules too. Here are a few from the same simplest 256-rule 
set as rule 30 



rule30 




rule 182 rule 193 


Moving deeper from the edge the results look a little dif- 
ferent (for ahcionados, rule 89 is a transformed version of 
rule 45, rule 182 of rule 90, and rule 193 of rule 110) 



rule30 rule73 rule89 rulel05 rule!35 rule!82 rule!93 
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And starting from random initial conditions, rather 
than a single black cell, things again look different: 


train station uses my all-time favorite discovery in the 
computational universe - rule 30! And it looks great! 





rule 30 rule73 rule89 rulel05 rule!35 rule!82 rule!93 


And here are a few more rules, started from random in- 
itial conditions: 



rule41 rule 60 rulelOS rulel06 



Heres a website (made in a couple of minutes with a 
tiny piece of Wolfram Language code) that lets you ex- 
periment (including with larger rule numbers, based on 
longer-range rules). (And if you want to explore more 
systematically, heres a Wolfram Notebook to try.) 


• • * □ C«*U4< Aubsm?** K 

•- C i Secur# h(!: p&^whwjhoN rimetoiKl_conn; o ;edij , -s,-sv-pan*ls it 


Cellular Automaton Panel Explorer 



Its amazing whats out there in the computational uni- 
verse of possible programs. Theres an inhnite range of 
possible patterns. But it s cool that the Cambridge North 


The Bigger Picture 


Theres something curiously timeless about algorithmi- 
cally generated forms. A dodecahedron from ancient 
Egypt still looks crisp and modern today. As do periodic 
tilings - or nested forms - even from centuries ago 



But can one generate richer forms algorithmically? Be- 
fore I discovered rule 30, Id always assumed that any 
form generated from simple rules would always some- 
how end up being obviously simple. But rule 30 was a 
big shock to my intuition - and from it I realized that 
actually in the computational universe of all possible 
rules, it s actually very easy to get rich and complex be- 
havior, even from simple underlying rules. 

And what s more, the patterns that are generated often 
have remarkable visual interest. Here are a few pro- 
duced by cellular automata (now with 3 possible colors 
for each cell, rather than 2) 
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Theres an amazing diversity of forms. And, yes, theyre 
often complicated. But because theyre based on simple 
underlying rules, they always have a certain logic to 
them: in a sense each of them tells a dehnite “algorith- 
mic story”. 

One thing thats notable about forms we see in the 
computational universe is that they often look a lot like 
forms we see in nature. And I dont think thats a coin- 
cidence. Instead, I think what s going on is that rules in 
the computational universe capture the essence of laws 
that govern lots of systems in nature - whether in phys- 
ics, biology or wherever. And maybe there s a certain fa- 
miliarity or comfort associated with forms in the com- 
putational universe that comes from their similarity to 
forms were used to in nature. 

But is what we get from the computational universe art? 
When we pick out something like rule 30 for a particu- 
lar purpose, what were doing is conceptually a bit like 
photography: were not creating the underlying forms, 
but we are selecting the ones we choose to use. 

In the computational universe, though, we can be more 
systematic. Given some aesthetic criterion, we can au- 
tomatically search through perhaps even millions or 
billions of possible rules to find optimal ones: in a sense 
automatically “discovering art” in the computational 
universe. 

We did an experiment on this for music back in 2007: 
WolframTones. And whats remarkable is that even by 
sampling fairly small numbers of rules (cellular automa- 
ta, as it happens), were able to produce all sorts of inter- 
esting short pieces of music—that often seem remarka- 
bly “creative” and “inventive”. 

From a practical point of view, automatic discovery 
in the computational universe is important because it 
allows for mass customization. It makes it easy to be 
“original” (and “creative”) - and to find something dif- 
ferent every time, or to fit constraints that have never 
been seen before (say, a pattern in a complicated ge- 
ometric region). 

The Cambridge North train station uses a particular 
rule from the computational universe to make what 
amounts to an ornamental pattern. But one can also use 
rules from the computational universe for other things 
in architecture. And one can even imagine a building in 
which everything - from overall massing down to de- 
tails of moldings - is completely determined by some- 
thing close to a single rule. 

One might assume that such a building would some- 
how be minimalist and sterile. But the remarkable fact is 
that this doesn t have to be true - and that instead there 
are plenty of rich, almost “organic” forms to be “mined” 
from the computational universe. 


Ever since I started writing about one-dimensional cel- 
lular automata back in the early 1980s, theres been all 
sorts of interesting art done with them. Lots of differ- 
ent rules have been used. Sometimes they’ve been what 
I called “class 4” rules that have a particularly organic 
look. But often its been other rules - and rule 30 has 
certainly made its share of appearances - whether it s on 
Aoors, shirts, tea cosies, kinetic installations, or, recent- 
ly, mass-customized scarves (with the knitting machine 
actually running the cellular automaton) 



But today were celebrating a new and different man- 
ifestation of rule 30. Formed from permanent alumi- 
num panels, in an ancient university town, a marvellous 
corner of the computational universe adorns one of the 
most practical of structures: a small train station. My 
compliments to the architects. May what they’ve made 
give generations of rail travelers a little glimpse of the 
wonders of the computational universe. And maybe 
perhaps a few, echoing the last words attributed to the 
traveler in the movie 2001: A Space Odyssey ; will ex- 
claim “oh my gosh, its covered in rule 30s!” 


Reprinted from blog.stephenwolfram.com 
with kind permission of Stephen Wolfram 
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Icosahedral Honeycomb 

The Poincarĕ ball model view of a hyperbolic 
regular icosahedral honeycomb. The icosa- 
hedral honeycomb is one of four compact 
regular space-filling tessellations in hyper- 
bolic 3-space. 
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Seven Million Reasons to 

Love Maths 

Tom Crawford 


I have always liked Maths; my hrst memory of the 
subject is from the tender age of seven. We were 
doing addition questions and had been told by the 
teacher to work our way through a set of exercises in 
a textbook. I remember racing through the questions 
and hnishing the work we had been set in no time, but 
unsatished I carried on, and on, and on, until suddenly 
there were no more questions left and I realised I had 
hnished the book... 

Fast forward to my teenage years and I was getting ex- 
cited by maths once again, this time reading about the 
Millennium Maths Problems. Its quite simple really, tell 
someone (especially a teenager) that they can be a mil- 
lionaire by doing a particular thing and suddenly that 
thing seems a lot more attractive. This is exactly what 
the Millennium Maths Problems are: seven unsolved 
problems that each have a prize of $1 million for a 
correct solution. If this sounds great, then youre right 
- it is - but a little word of caution: these are not your 
standard everyday Maths problems. One of them (The 
Yang-Mills Mass Gap Hypothesis to give it its full title) 
basically asks the question £ why do things have mass?’ 
Another, the Hodge Conjecture, is so complicated that 
depending on which mathematician you ask you will 
get a different version of the problem. How can we hope 
to solve a problem when we cant even agree on what it 
is in the hrst place? It s almost like mathematicians pur- 
posefully make things as dithcult as possible to main- 
tain a sense of superiority. Well, not me. 

This is maths, but not as you know it. That means its 
fun. That means it s entertaining. And most importantly 
that means it s understandable. If after my explanations, 
you cant explain the subject to your grandma then IVe 


failed. You can even shout at me via various social me- 
dia platforms and let me know. I’m here to rock maths 
and the Millennium Problems are the perfect starting 
point. They are the hrst bit of maths that really got me 
excited about the subject and I hope they will have the 
same effect on you. Let s begin with a little history les- 
son... 

The problems were hrst presented at a meeting in Paris 
in the year 2000 (as the name would suggest) as maths 
looked to jump on the “oh look its a new millennium 
lets do something exciting” bandwagon. This wasnt 
the hrst time important unsolved maths problems had 
been discussed in the French capital. Go back to the 
year 1900 and the rather excitingly named ‘Internation- 
al Congress of Mathematicians’ where a man named 
David Hilbert presented a list of 23 problems that he 
believed to be the greatest unsolved problems in maths 
at the time. You can think of the Millennium Problems 
as sort of a Aashier, streamlined, more up-to-date ver- 
sion of the idea, pretty much like the new Beauty and 
the Beast movie that has just come out. In the new hlm 
you swap a cartoon princess for Emma Watson and with 
the Millennium Problems you swap 23 problems with 
no prize money for 7 questions each worth a million 
dollars. I know which one I prefer... 

The funding for the prizes comes from the Clay Insti- 
tute who worked together with leading mathematicians 
to pick the hnal seven. I’m not sure exactly how you 
pick the ‘greatest unsolved problems currently faced 
by mathematics’, but I like to think that they locked 
everyone in a dark room and held some kind of secret 
candlelit ballot... it certainly adds drama to the election 
of a new pope - maybe they even released some white 


About the Author 

Tom is a tutor in mathematics at St Hughs College at the University of Oxford. He previously worked for the Naked 
Scientists - an award-winning production company that specialises in broadcasting science news internationally via the 
radio and podcasts, and can frequently be heard talking about all things science and maths on BBC Radio. He completed 
his undergraduate degree in maths at the University of Oxford and recently obtained his PhD from the University of 
Cambridge in experimental fluid mechanics. All of Toms maths outreach material can be found on his website tom- 
rocksmaths.com and you can follow him on Twitter, Facebook and Instagram @tomrocksmaths. This article consists of 
the hrst three parts of an eight part series Tom has written about the Millennium Maths Problems. 
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smoke after each problem was picked. But I’m getting 
ofFtopic. The actual criteria for a problem to be selected 
was that it must be a ‘classical problem that had resisted 
solution for many years’ Fm not sure that really narrows 
it down, but I suppose maths doesn t exactly have a rep- 
utation for doing things the easy way... until now. 

Over the next few weeks I plan to delve deeper into each 
of the seven problems in turn: telling you what it is, why 
you should care about it and what it would mean if we 
were somehow able to solve it (besides becoming mil- 
lionaires and diving into a pool of gold Scrooge McDuck 
style). We’ll have fun -1 promise - and who knows you 
may even learn something along the way. As the great 
Lady Gaga once said “give me a million reasons...” Lady, 
I plan to give you seven million. 

The Riemann Hypothesis 

The Riemann Hypothesis is a great place to begin our 
adventure exploring the untamed jungle of the Millen- 
nium Problems, as its the only one of Hilberts original 
23 problems from 1900 that remains unsolved (if you 
have no idea what Lm on about you had better go back 
and read my hrst article here - its great trust me). 

The Riemann Hypothesis is both an oldie and a goodie. 
It was hrst formulated in 1859 by Bernhard Riemann 
and is still puzzling mathematicians over 150 years later. 
The hypothesis looks at prime numbers - these are the 
ones that can only be made by multiplying themselves 
by one and have no other factors. Lets take the num- 
ber 42, also known as the answer to the universe and 
everything in it. 42 has lots of factors, 7 x 6, 21 x 2, 42 
x 1, okay it has six factors, but importantly it has more 
than two. Now what about the number next to it, 41. 
The only way to get this is by multiplying 41 times 1 - 
thats it. This makes 41 a prime number. 

Prime numbers are very special in maths because all 
numbers can be made from them. If I give you any 
number, you can break it down into smaller and smaller 
pieces with each one being a prime number. Back to our 
example of 42. 7 xf6 is the same as 7 x (3 x 2). 21 x 2 is 
the same as (7 x 3) x 2. 42 x 1 is also 7x3 x 2. YouTl 
notice theres a pattern here, the building blocks to make 
42 are 7, 3 and 2 which are all prime numbers. This will 


be true for any number you can think of - give it a go: 
write down the factors, and then keep breaking down 
each factor into smaller and smaller factors and eventu- 
ally you get down to prime numbers. If DNA gives the 
building blocks of life, prime numbers give the building 
blocks of maths. While we now understand DNA pretty 
well - we can edit it and cause all kinds of cool things to 
happen, the same cannot be said for prime numbers. We 
know what they are and that all numbers can be written 
in terms of them, but thats pretty much the whole sto- 
ry. We really dont understand them very well, which 
brings us very nicely back to our Millennium Problem. 

The Riemann Hypothesis talks about a very compli- 
cated function called the Riemann Zeta Function. ITl 
show you what it looks like because it s cool - and if you 
want to impress your friends, just say “I know what the 
Riemann zeta function looks like” and write the formu- 
la down. Disclaimer: the success of this tactic may or 
may not depend on your friends’ opinion of cool look- 
ing maths. Fd be impressed at least. Anyway, here it is 



Beyond how cool it looks written down, we dont real- 
ly need to know any more about it. The Riemann Hy- 
pothesis basically says here are some properties about 
this Riemann function: are they true or not? If they are, 
and most mathematicians think they will be, then it 
will mean that we understand the location of the prime 
numbers much better and most importantly we know 
where to look to hnd them. 

So why do we even care where prime numbers are on 
the number line? This is the million-dollar question, 
quite literally. When you use your credit card online, 
the details are encrypted with a code made up from a 
number with 200 or so digits to protect you from hack- 
ers. Even if they intercept the signal, it just looks like 
nonsense. The only way to decrypt the code is to know 
which two prime numbers multiply together to make 
the 200-digit number. Breaking a number down into 
prime number factors like we did above for 42 isnt too 
tricky, because it’s a small number. Once the numbers 
get larger, the game changes. Even the most powerful 
computers can only work out the prime factors of num- 
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bers up to 100 digits in length - so with 200 digits your 
credit card details are ‘as safe as houses’. 

Current methods of online security work precisely be- 
cause we dont understand the prime numbers. If you 
were to solve the Riemann Hypothesis you wouldn t just 
get the million-dollar prize, youd also be able to steal 
the credit card details of everyone on the internet... I 
am absolutely not saying you should do this, just you 
know, you probably could if you were that way inclined. 

P versus NP 

Number two on the list is another big-dog in the world 
of maths. Its so famous its even appeared in several 
Hollywood movies as directors try to add more credi- 
bility to their stories involving a robot-apocalypse - yes, 
I’m looking at you iRobot. The problem is called P ver- 
sus NP. 

It s a relatively new addition to the world of maths in 
its current form, but the problem it s trying to solve can 
be traced back to prehistoric times. Picture the scene: 
you wake up in your cave and youre hungry, youre 
thirsty and you could probably do with a wash as your 
attractive neighbour in the cave next door is coming 
over later for some quality c netflix and chill’ time. That s 
a good point actually, you also need some Aowers and 
the latest blockbuster cave painting. That makes five 
things, meaning five different places you need to visit. 
The question is: what s the quickest route that you can 
take that visits all five locations and ends with you back 
at home. Today we just use Google, but as prehistoric 
you is not so lucky, let s try some maths. 

There are 5 places to visit, which means you have a choice 
out of 5 for the first place to go to. Once you’ve visited 
stop one, you then have a choice of 4 places to visit next. 
Then a choice of 3, then 2 and finally no choice for the 
last one as its the only one left. So in total that s 5 x 4 x 
3x2x1 = 120 different routes. We can probably cross 
a few routes off the list using our knowledge of the local 
area - maybe half of them pass through the swamplands 
home to Mr T. Rex, not only slowing you down severely, 
but also giving a high probability of death. 

When you ask Google to work out the quickest route 
it will actually check all 120 possibilities and then give 
you the fastest. Not really a problem today, computers 
are quick and so you can probably get the quickest route 
in less than a second. But what if you want to visit 1000 
different places - maybe youre very popular in your 
prehistoric village and have many partners to visit - that 
means 1000 x 999 x 998 x ... x 2 x 1 different routes to 
check. I’m not going to write down what that number is, 
but I can tell you that for a computer today to check that 
many different routes it would take longer than the age 
of the universe. 


This is pretty much the P versus NP problem in a nut- 
shell: if we can program a computer to recognise a solu- 
tion to a problem quickly, can we also program it to find 
that solution quickly? With our example, your comput- 
er can tell you if your suggested route will visit all 1000 
stops in a matter of milliseconds, but hell will quite lit- 
erally freeze over before it can tell you if it s the fastest of 
all possible routes. P versus NP is asking whether or not 
there is a shortcut that doesnt involve checking every 
single possible combination. 

Most mathematicians believe that P does not equal NP 
and in fact if the two are shown to be equal the world 
as we know it will probably end. P=NP means we can 
instantly break any code that has ever existed, predict 
stock market trends before they happen and most wor- 
ryingly build robots with artificial intelligence higher 
than anyone on earth. Better get Will Smith on speed 
dial just in case iRobot wasnt too far wrong after all... 


84 



Belousov-Zhabotinsky Reaction 

The Belousov-Zhabotinsky reaction is one of 
a class of reactions that serves as a classical 
example of non-equilibrium thermodynam- 
ics, resulting in the establishment of a non- 
linear chemical oscillator. Here, a computer 
simulation of the Belousov-Zhabotinsky re- 
action occurring in a Petri dish is shown. 
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Comic Corner* 

*we make no apologies forthe lack of guality of some of these jokes 



The best (worst) of the puns... 

Q: Why didn't Newton discover group theory? 
A: Because he wasn't Abel. 

Q: What does the little mermaid wear? 

A: An Algebra 

Q: What is a dilemma? 

A: A lemma that proves two results. 


The five best excuses for why you didn't 
do that last example sheet... 

1.1 have a solar powered calculator and it was 
cloudy. 

2 . Isaac Newton's birthday. 

3 . I could only get arbitrarily close to my text book. I 
couldn't actually reach it. 

4 . I have the proof, but there isn't room to write it in 
this margin. 

5 . 1 could have sworn I putthe homework inside a 
Klein bottle, butthis morning I couldn'tfind it. 


Hl THI6 IS 


YDUR SOT55CH00L. 

HAl/JNG SOTIE 
COrtRjTERTTOUBLE. 



m CEBR - DID HE 
BREAKSOrtEWNG? 


!N A UAV- ) 



D1Q YW REfU.LV 

wnzym sm 

Rot^rt'); ORCP 

.TaBLE ${aJen<s;— ? 

-0H.YESUT71E 
B06BYTABLES. 
WE CALL Hlrt- 


UELL, VEVE L05T TWS 
YEAR’5 SHPEWTRECCRDS. 
I H0FE YttM mPFY, 

AHOIHOFE1 
^YOUYELEARWED 
T0SWmzE WW? 
[HIWSE INPtlRL 




86 





























I UISH I COUt-D 
DO M.ATH L/KE WHEN 
I WAS YOONG. 



IT DOESNT COTtE Efl5Y 
UKE IT ONCE DIO. 

UH KUH. 


\ 



mTH \S A G^tlE FOR 
TKE YOUMG. I NEED 
TO SJT6ACK ANDLET 

the ruruRE wpen. 


V 



\OU'RE TWRTECN. 



NB/ER Hft/E r FELT SO 

UH0UD?E:VtXJ, 

ao5Ei&flwarHQ? sow. 

DtNVQ?G)0ER^ 

YCT 50 HEWESSiy ALONE 

P6 icstgu: f*l EKRCR 

WHERES RESULT 

IMWD/Pti).'JSEEP/ 

\^\ 

AWEPO BYSOMEOtC 

wnn tcswe: p&am 

AWONOANSUD? 

LflSr RJSTED Tt> N 2003 


JTi 

i I IJb 


Ode to the Mobius strip... 

A mathematician confided 
Thatthe Mobius band is one-sided 
And you'll get quite a laugh 
lf you cut one in half 

'Cause it stays in one piece when divided. 

A mathematician named Klein 

Thoughtthe Mobius band was divine 

Said he: lf you glue 

The edges of two 

You'll get a weird bottle like mine. 


One from the mathematician, physidst and engineer collection... 

A mathematician, a physicist, and an engineer went again to the races and laid their money down. Commiserating 
in the bar after the race, the engineer said, "I don't understand why I lost all my money. I measured all the horses 
and calculated their strength and mechanical advantage and figured out how fast they could run..." 

The physicist interrupted him: "...but you didn't take individual variations into account. I did a statistical analysis of 
their previous performances and bet on the horses with the highest probability of winning..." 

"...so if you're so hot why are you broke?" asked the engineer. But before the argument could grow, the mathema- 
tician took out his pipe and they got a glimpse of his well-fattened wallet. Obviously here was a man who knows 
something about horses. They both demanded to know his secret. 

"Well," he said, "first I assumed all the horses were identical and spherical..." 
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A casebook of ingenious, perplexing 
and totally satisfying puzzles 

Whch is the odd one out and whyT» 


Can You Solve My Problems? byAlex Bellos 

From the author of Alex's Adventures in Numberland and Alex Through the Looking Glass, 
comes Can You Solve My Problems?] a book that challenges you to pit your wits against 125 
carefully chosen puzzles. Put simply, Alex Bellos has again served up a great book for mathe- 
matical novices and experts alike. Successfully bringing together puzzles of a range of types, 
from logical to geometrical, and of a range of difficulties, from easy-peasy to nigh-on impossi- 
ble, there's certainly something here for everyone. What really sets this book apart from other 
puzzle collections though is the focus on the origins and historical development of each of 
the puzzles, as well as the exceedingly detailed solutions. A great gift for anyone that enjoys 
puzzles of any kind. 



Reality is Not What it Seems by Carlo Rovelli 

Carlo Rovelli, author of the bestselling Seven Brief Lessons on Physics, returns with this great 
read on the nature of the universe. As in his previous book, Rovelli writes with a clarity and el- 
egance that makes the book a joy to read. By no means a long book (physics aficionados may 
find it a little bit on the short side), he still manages to pack in much detail; taking the reader on 
a journey through mankind's evolution of the understanding of the universe. Indeed, with little 
math, and a healthy dose of drawings, Rovelli is impressively able to make such a complicated 
subject tractable: from Democritus to cutting edge quantum gravity research, 2500 years of 
physics has never been so accessible to the lay person. Whether you're in to science, history, 
or philosophy, you will find something of interest! 


The Indisputable Existence of Santa Claus 

by Hannah Pry and Thomas Oleron Evans 

Have you ever wondered how to cook the perfect spherical turkey? Or how to wrap gifts using 
their surface area to volume ratio? Well there's answers to these questions and many more 
you probably never knew you even had in this joke-packed festive book. Traversing their way 
through a typical festive season, in short, the book serves as a guide to the perfect mathemat- 
ical Christmas. The authors really should be commended for so impressively demonstrating 
how mathematics can infiltrate almost everything. With some geometry, Markov chains, and 
game theory, the intricacies of the maths may be too much for some, but generally it's a highly 
accessible humerous read. A short book, at only 135 pages, there's few better stocking fillers 
out there for the maths lover in your life! 


\ > 

^ THE 

INDISPUTABLE 

EXISTENCE 

* OF 

SANTA CLAUS* 

THE MATHEMATICS 
OF CHRISTMAS 


DR HANNAH FRY 
& 

DR THOMAS OLĔRON EVANS 


Professor Stewart's Incredible Numbers 

by lan Stewart 

Incredible Numbers is the latest in a long line of offerings from the grand master of popular 
mathematics writing, lan Stewart. In it, Professor Stewart explores the astounding and surpris- 
ing properties of a whole host of numbers; from as simple as 2 to as bizarre as 1.059463. Reals, 
rationals, irrationals, and imaginary numbers are all given focus. However, what is perhaps 
slightly confusing giving its title, is that this is not a book about numbers at all. The numbers 
are simply a way of introducing a host of intriguing mathematical techniques and problems, 
and how they arise in an array of fields from art to physics. As with all of the author's books, it 
is a truly accessible, interesting and entertaining read for all ranges of mathematical abilities. 



PROFESSOR 
STEWARrS i 
INCREDIBLE * 
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The Lapidary Numbers 

Or the Combinatorics of Communication by Throwing Stones 

Peter Taylor 

Alumnus of Trinity College, 1999 


Two spies (Alice and Bob) need to exchange messages. 
Each will encode their message as a number from 1 to m 
before they meet. They will then meet by the river and 
communicate secretly by throwing stones. There will 
be a pile of n indistinguishable stones at their meet- 
ing point. Starting with Alice, they take turns throwing 
some of the stones into the river. Each spy must throw at 
least one stone on their turn, until all n stones are gone. 
They observe all throws and separate when there are no 
more stones. No information is exchanged except the 
number of stones thrown on each turn. 

Given that n is known in advance, what is the largest 
possible value of m? 

Let us call this value the n th lapidary number, from the 
Latin lapis, lapidis meaning stone, and denote it L{jl). 
The earliest form of this question of which the author is 
aware is a February 2011 question on a Russian puzzle 
site asking whether L(26) > 1700 [1]. 

Analysis: decision trees 

When there are k stones remaining, the current player 
can throw 1 to k stones, so the full decision tree for 
the exchange can be dehned inductively. If is the 
full decision tree when k stones remain, we have that 
i is a node with k + 1 children, made up of one 
instance of Fi for 0 < i < k. (Fq is a leaf, because 
there are no stones left to throw). 

However, if Alice has the option of throwing all k 
stones then Bob might not get a chance to throw any, 
and hence cannot be certain of communicating any 
more than 1 distinct message. To allow both players to 
communicate as many messages as possible, they must 
agree to prune the full tree. Let T^ be the set of permis- 
sible decision trees with k stones remaining, and 6 be a 
placeholder to replace a subtree which has been pruned 
away. Then we have Tq = {Tq} and 

( k \ k 

1J Ti U {e} J \n {£} , 

i =0 / i=0 

since we cant prune all the subtrees of a node away, as 
that would leave some stones unthrown. 


There is a computationally useful bijection (or identity, 
if we formalise tuples as nested ordered pairs) which ex- 
presses Tfc_|_i solely in terms of T^ 

/ k-i \ k 

T k +1 = (T k U {e}) X n Ti U t e > \ n 
V i =o ) i =0 

T k X (r k u n {<d j j 

u x ^ u n j \ n{ £ }. 

= ( T k X T k ) U (t*. X n { £ } j u ({ £ } x T k ). 

We can assign a value to each element t^ G T^ of 
(x, y ) where 7T}(x, y) — X is the number of mes- 
sages which the hrst player can communicate, and 
7l"2 (x^y) = y is the number which the second player 
can communicate. The value of Tq is (1, 1). For a tree 
with k stones remaining, t^ = (to, £i, . . . , £/c-i), 

the hrst player will throw one or more stones to choose 

one of the children, and will be the second player at that 
child; so TTitj^ = S^_q 7 T 2 L. The second player has 
no influence over which of the children will be chosen, 

• k— 1 

so 7T2tk = min^_Q 7Ti t{. This is consistent with the 
use of 6 for pruned subtrees if we assign to 6 the value 
(0, oc). If we identify the elements of T& with their 
values then the bijection above gives 

T k+1 = {(^ + V > min(w, z)) | ( w , x) G T fc , ( y , z) G T k } 

U T k U {(a:, w) | ( w , x) G T k }. 

Iterated integer partition sum 
construction 

An integer partition A is a non-increasing sequence 
of positive integers (Ai, A 2 , . . . , A r ). These integers 
are called the parts of A. It is sometimes convenient to 
use the frequency representation A = (l ai 2 a2 . . .) 
where a n is the number of parts which are equal to n. 
It is also sometimes convenient to represent a partition 
as a Ferrers diagram: a pattern of dots in which the n th 
row has X n dots. The conjugate partition A* is ob- 
tained by transposing the Ferrers diagram of A. 

Let us dehne the sum of two partitions 
A = (l ai 2“ 2 .. .) and n = (l bl 2 b2 ...) as 
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A + n = (l“i+ & i2 a2+62 ...). This does not ap- 
pear to be a construction which has been much stud- 
ied 1 , but it allows us to make a surprising connection 
between the pruned decision trees and integer parti- 
tions. 

Observe that if ( X,y ) G T& and X > 1 then 

{pc — 1, y) G and similarly if y > 1 then 

(x, y — 1) G since each spy can prune the cases 

where they would communicate their highest num- 
bered message. Therefore if we mark the pairs 0 ,y) 
on a Cartesian grid we obtain a Ferrers diagram, and 
hence a correspondence between T& and the integer 
partitionwhose n th partis max{y | (n,y) G T&}. 
This gives a simpler method of constructing T&. 

Lemma Dehne a sequence of integer par- 
titions A (/c) iteratively as A (1) = (1) and 

A( fc+ 1) = A (fc) + A (/c) *. Then T^ corresponds to 

A (fc) . 

Proof: By induction. The base case is simple: 
T\ — {(1, 1)} because there are no decisions to 

make, and A (1) = (1) by construction. For the in- 
ductive step, the inductive hypothesis combined with 
Equation (1) make it necessary and sufhcient to show 
that 

4*+« = max ({min (aJ>> , A (^) 1 1 < 0 < n } 

U 

But this follows almost directly from the sum con- 
struction. There exists CL such that the hrst U 

parts of A (/c+1) are the hrst a parts of A (/c) and 
the hrst n — a parts of A (/c) *. If a = 0 then 
A(f +1 ) = A+; if a = n then A (fc+1) = A (fc) ; 

and otherwise it is the least of A^ or A^l a because 
the parts are non-increasing. Moreover, precisely be- 
cause the parts are non-increasing, it must be the 
greatest of the values that would be obtained by taking 
a G {0, . . . , n}. 

The sequence A (/c) was studied by Naohi- 
ro Nomoto in 2002 and he contributed A064660, the 
sequence of the number of distinct parts in A (/c) , to 
the On-Line Encyclopedia of Integer Sequences [2]. 
Nomoto is unaware of any other publication on the 
sequence 2 . The author wishes to express his gratitude, 
as A064660 was the clue which led him to Lemma 1. 

Remark It follows from the dehnition of the lapidary 
number L(n) and the correspondence between the 
decisiontree T n and the partition A (n) that L(n) is 
the size of the Durfee square of A (n) . By using the fre- 
quency representation, this provides the most ehicient 
method known to the author for calculating lapidary 
numbers. T(l) to T(60) are 


1 

1 

1 

2 

2 

3 

4 

6 

8 

12 

16 

23 

31 

45 

61 

87 

119 

171 

233 

334 

459 

655 

904 

1288 

1782 

2535 

3517 

4995 

6935 

9848 

13703 

19437 

27070 

38376 

53528 

75842 

105878 

149966 

209555 

296707 

414922 

587304 

821853 

1163052 

1628574 

2304082 

3228091 

4566345 

6400884 

9052798 

12695506 

17953139 

25187813 

35614287 

49984812 

70669026 

99219168 

140263652 

196992898 

278461677 


Although some partition statistics of A (/c) have easily 
derived closed forms, the Durfee square does not seem 
to be one of them. Moreover, numerical checks rule out 
a recurrence of the form 

L (n) = E i E j aijrtLin - j), 

for moderate ranges of i and j, and the OEIS Super- 
seeker [3] does not shed any light on a possible generat- 
ing function for L(n) or for T(4n). 

The limit of the author s current knowledge on the as- 
ymptotic behaviour is that L(n) is within a sub-expo- 
nential factor of 2 n ^ 2 . The upper bound is easy: A (n) 
is proven by induction to be a partition of 2 n— \ and 
its Durfee square is thus bounded above by 2 (n—1)//2 . 
For the lower bound, a simple stone-throwing strat- 
egy due to Zeng [5] sudices: assign each player n/ 2 
stones (with suitable rounding according to the value of 
n mod 4 ) to be split between n/ 4 throws (ditto), and 
the well-known asymptotic behaviour of the central bi- 
nomial coehicients gives a suitable result. For example, 
for n = 0 (mod 4) it gives a lower bound of 

1 1 \ 2 n / 2 

2 n) yjn - 4 

In summary, the lapidary numbers provide an unusual 
application of partitions which is useful computational- 
ly, although the Durfee square of a sum of two partitions 
does not seem amenable to analysis. 

1. It is not mentioned in the chapter on partitions in Dicksons history [4], nor 
in Andrews’ and Eirkssons more recent overview [6]. 

2. Personal correspondence. 
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Hopf Fibration 

The Hopf fibration describes a 3-sphere in 
terms of circles and an ordinary sphere. 
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A New Variant of the Josephus 
Problem and Its Solution 

Anuradha Mahasinghe & Mohamed Sabri 

Department of Mathematics, University of Colombo, Sri Lanka 


Flavius Josephus, the hrst-century Jewish scholar, proba- 
bly knew of a secret formula for being the last person in 
a circle, when every 3 rd person was counted and assas- 
sinated. Josephus’ secret formula has inspired a number 
of mathematicians throughout the ages. Interestingly, 
Sri Lankan rural children have long played a traditional 
game, from which a much generalised version of the Jose- 
phus problem can be derived. Here we take a close look at 
this folk game, formulate this new variant of the Josephus 
problem, and derive its solution. 

Aturu-mituru: A Sri Lankan 
Folk Game 

Two decades ago, you likely would not find a single 
household in Sri Lanka where aturu-mituru was not 
played. In it, the players sit in a circle with their hands on 
the floor facing down. One player recites the verse “atu- 
ru mituru dambadiva turu raja kapuru settiyd...” touch- 
ing the hands of every player, one by one. The player on 
whose hand the verse halts according to its last syllable, 
takes his hand off, and the game continues by repeating 
the same recitation from the hand immediately next to 
that. Regional variants of the game differ, due to the verse 
and the exact place the verse halts. According to one ver- 
sion we remember, the syllable and the rhyme imply that 
every 8 th , 8 th , 8 th and 3 rd hands are eliminated during 
one recitation of the verse. The winner is the player whose 
hand remains at the end. 

This version would be the problem widely known as the 
Josephus problem, if not the varying eliminating indices 
8, 8, 8 and 3. The Josephus problem is named after Fla- 
vius Josephus, the author of famous Jewish-Roman war 
records. He was the commander of the Galilee Jewish 
force who fought against the Romans during the first 
Jewish-Roman war. It was Josephus’ surrendering to the 
Romans which originated the mathematical puzzle. It is 
said that 41 Jews, including him, had to withdraw to a 
cave, to survive from the Romans. Knowing the Romans 
would find them, preferring honour over captivity, they 
unanimously decided to kill themselves. People were ar- 
ranged in a circle, and a count was taken from a certain 
position so that every third person was assassinated. The 
count is said to have continued until Josephus and one 
other person were left alive. 

Thus, we may state the Josephus problem as follows: 


“given a circular arrangement of m people, from which 
each r th is eliminated until only one person remains, 
what would be the position of the survivor?” 

It seems that the Josephus problem was more or less uni- 
versal in the medieval period. Variants of the problem are 
recorded from different regions. One may find a handful 
of such regional variants in [1]. Apart from these folk- 
lore-based variants, mathematical curiosity has created 
some other generalisations and extensions of the prob- 
lem such as the linear [2], the extended [3], and the fe- 
line [4], Josephus problems. It should be noted that the 
eliminating index is assumed to be constant in all of these 
versions. 

The New Josephus problem 

Inspired by the relevant version of the traditional game 
aturu-mituru, we present a new variant of the Josephus 
problem, in which the eliminating index is not necessar- 
ily constant throughout the game. Let us call it the new 
Josephusproblem. Accordingly, the problem statement of 
the new Josephus problem is as follows: 

“given a circular arrangement of m people for which the 
eliminating index in the k th round is given by a k , what is 
the position of the survivor?” 

Before solving the new Josephus problem, let us clarify 
how it would exactly be mapped into the game of atru- 
ru-mituru. Suppose 7 kids are taking part in the game. 
That is, there are 14 hands (m = 14), yet the elimination 
(a k ) is taken in the order 8, 8, 8, 3. We may consider this 
as a subsequence of the 14-termed sequence, in which 
every 4r th (r = l,2,or3) term is 3 and every other term 
is 8. Accordingly, in the new Josephus problem, whenever 
the elimination sequence consists of lesser terms than the 
number of people (m), we may form the corresponding 
m-termed supersequence. On the other hand, the number 
of rounds cannot exceed the number of people. Consid- 
ering these two facts, we set the number of terms in our 
sequence {a k } to m. 

There is a trivial case of the new Josephus problem. That 
is, when a k = m-k+l, the survivor would be the 
person from whom the count starts. On the other hand, 
the case a k = r for all k (that is, r is the constant elim- 
inating index), results in the original Josephus problem. 

Similar to the ordinary Josephus problem, the main task 
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of the new Josephus problem is to find the safe position in 
the circle. Accordingly, we present a recursive formula to 
find this safe position, and later apply it to find the initial 
position of the person eliminated in the k th round, which 
had been an additional concern regarding the ordinary 
Josephus problem. 

The Solution of the New Jose- 
phus Problem 

It is quite natural that the person from whom the count- 
ing starts is labelled as the first person in the circle. The 
final goal is to predict the safe position with respect to 
this initial labelling. Having said that, however, in our 
solution procedure, we adopt a slightly different method 
of labelling, which changes with the round. 

Imagine a round starting with n people (n < m) remain- 
ing in the circle. Now the count is started from some per- 
son in the group of these n persons. We assign the label 
1 to that person, and also the other persons are labelled 
as 2, 3, ... , n in the same order from which the count is 
taken. At the next round, when n - 1 people are there, 
we renew our labelling. Now, label 1 is given to the person 
from whom we start the new round. With respect to this 
labelling, what would be the position of the survivor? 

Let us label the survivor’s relative position by f (n). That 
is, whenever there are n people in the circle, if the count 
in this round is taken from the first person to the n th 
person, the survivor would be the person at the f (n) th 
position now, if the game was continued until only one 
person remains. 

But of course it does not matter how many people re- 
main in the circle! If the eliminations were continued, we 
might have a unique survivor. This survivor s position is 
assigned two different labels at two consecutive rounds, 
namely f (n ) and f (n - 1) . It can be shown that these 
two labels are related to each other as follows 

f(n) = {f(n - 1) + a m _ n+1 } mod n. (1) 

The verification of this formula is not difficult, yet not 
straight-forward either. Our argument is related to that 
for the ordinary Josephus problem from [5]. 

If n people remain in the circle, it must be the (m - n 
+ 1) st round which is to be started. Let the count go 

on. Then, the person at the position a m _ n+1 mod n (with 
respect to our relabeling) will be eliminated. The next 
round starts from the person next to him. That is, the 
person whose position would be 1 in the next, (m - n 
+ 2) nd , round is at the (a m _ n+1 mod n + l) 5t position 
in our current labelling. Likewise, the mapping in Table 1 
can be easily verified. 

Therefore, the person at the position k in the (m - n 
+ 2) nd round, was at the position (k + a m _ n+1 mod n ) 
mod n in the previous round. Obviously, the survivor is 
no exception. Thus, Equation (1) is readily verified. 


This enables us to predict the safe position with respect to 
the initial labelling, when combined with the end condi- 
tion f (1) = 1, and evaluated until n = m. 


TabligJ 

(m - n + 2) nd round (m - n + l) st round 


1 

2 


n - a m _ n+1 mod n 
n - a m _ n+1 mod n + 1 
n - a m _ n+1 mod n + 2 

n - 1 


a m _ n+1 mod n + 1 
a m _ n+1 mod n + 2 

n 

1 

2 

n - a m _ n+1 mod n - 1 


The k th Elimination in the New 
Josephus Problem 

Finding the position of the person who is eliminated at 
the k th round of the Josephus circle has also been con- 
sidered in the literature [6]. The argument we used above 
can be twisted a little to find out this position in the new 
Josephus problem as well. 

That is, we consider the (m - n + 1) st round, where n 
people remain in the circle. Consider the person who is 
eliminated after the k rounds from now. We may denote 
this person by g (n , k) . Now, the same person can be 
regarded as the one who will be eliminated after k - 1 
rounds from the next round. In other words, the person 
for whom the labelling g (n , k) was assigned at the (m 
- n + 1) st round is assigned the new label g (n - 1, 
k - 1) inthe (m - n + 2) nd round. Using a mapping 
similar to the one in Table 1, it can be readily verified that 
the equations 

g(n, k) = (g(n - 1, k - 1) + a m _ n+1 ) mod n 

g( n ’ D = ^m-n+l m0d 0 

provide us a nice way of finding the initial position of per- 
son who will be eliminated at the k th round in the new 
Josephus problem. 
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Conquer or be Conquered 

Jack Hurst 


Rules 

Look at the hexagonal grid below which has two types of cells - clues (black) and answers (empty). Your job is to use the 

clues and follow these rules to hll in all of the answers. 

• Every answer must be a whole number between 1 and 50 (inclusive) and no number appears more than once. Since 
there are fewer than 50 answers, not every number from 1 to 50 will appear, and the smallest and largest number 
that appear may not be 1 and 50 respectively. 

• Each clue is in reference to all of the answers adjacent to it, i.e. its empty neighbours. Some clues may reference 
other clues. When referencing a clue cell in another clue you will see it in bold underline like A 

• For example if clue A is “Sum is 36. All prime” this means that all of the answers adjacent to clue Asum to 36 
and that they are all prime numbers. 

• Then if clue B is “Sum is 2 or 3 times A” This means that the answers adjacent to clue B sum to 2 or 3 times 
whatever the sum of the answers adjacent to A. In this case we know this is 36, so the answers adjacent to 2 
add up to either 72 or 108. 



Clues 


A. All primes, and all 1 away from a multiple of 6. 

B. Sum to 7 more than J. 

C. Only use the digits 1 and 3. 

D. The smallest and largest answer in the grid and 
their average. Lhe largest is a prime. 

E. Sum to a multiple of 23. All but 1 are 1 away from 
a square. 

E Two numbers and their sum. One of the smaller 
numbers is a multiple of 10. 

G. Sum is twice the sum of I. Sum is 11 times great- 
er than the smallest at G. 

H. Sum is 77. No primes. 

I. Product is a square which ends in 600. 

J. All divisible by 3 and sum ends in 1. 

K. All squares. 

L. Product ends in 0. All smaller than the smallest 
at G. 

M. Sum is a power of 2. 


Waming: This puzzle is dithcult! To complete it you will need to be good at maths and very persistent. You will need pen 
and paper to write out your workings. A calculator or a computer is not necessary, and you are discouraged from using 
either. Knowing about prime factorisation is essential, and knowledge of modular arithmetic is helpful. 


First printed at http://jackhurst.co.uk/blog/ 
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Buddhabrot 

The Buddhabrot is a fractal rendering tech- 
nique related to the Mandelbrot set. Its name 
reflects its resemblance to classical depictions 
of Gautama Buddha. 
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Solutions to the Integration 
Contest 


1. ^ cos(5x) - 

2. x cosh -1 x — 

O. 2 

4. 0 

5. 2vT+" sinir 


cos(13x) 
\J x 2 — 1 


6. f 

7. — cot(|) — x 

8. — cot( sm 2 la; ) — sin -1 x 

9 , a 

7 * sinha 

10. 

11. — csc x log(sin x) + csc x 

12. x(logx) 2 

e x(lQgx-l) 

I O . log x 

14. g tan 5 x + | tan 3 a; + tan a; 

15.1 
16.5 
1 7. eyT 

18. log(3a; 5 — 5a; 3 + 4a; 2 ) 

19-+I 

20. (seca;) secx 
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Join The Archimedeans 
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